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(57) Abstract: The present invention relates to a new essential downstream component of the wingless signaling pathway. In par- 
ticular, the invention relates to nucleotide sequences of the Drosophila melanogaster daughter of legless {doll) gene, of its encoded 
proteins, as welt as derivatives, fragments and analogues thereof. The invention includes vertebrate and invertebrate homologues of 
the Doll protein, comprising proteins that contain a stretch of amino acids with similarity to the Drosophila Doll gene. Methods for 
producing the Doll protein, derivatives and analogs, e.g. by recombinant means, and antibodies to Doll are provided by the present 
invention as well. The invention also relates to methods for performing high throughput screening assays for compounds modulating 
Doll function in the Wnt pathway. 
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A NEW ESSENTIAL DOWNSTREAM COMPONENT OF THE WINGLESS SIGNALLING 

PATHWAY 



The present invention relates to a new essential downstream com- 
ponent of the wingless signaling pathway. In particular, the in- 
vention relates to nucleotide sequences of the Drosoph'ila mela- 
nogaster daughter of legless (doll) gene, of its encoded pro- 
teins, as well as derivatives, fragments and analogues thereof. 
The invention includes vertebrate and invertebrate homologues of 
the Doll protein, comprising proteins that contain a stretch of 
amino acids with similarity to the Drosophila Doll gene. Methods 
for producing the Doll protein, derivatives and analogs, e.g. by 
recombinant means, and antibodies to Doll are provided by the 
present invention as well. The invention also relates to methods 
for performing high throughput screening assays for compounds 
modulating Doll function in the Wnt pathway. 

Background of the invention 

Wnt genes encode a large family of secreted, cystein rich pro- 
teins that play key roles as intercellular signaling molecules 
in a wide variety of biological processes (for an extensive re- 
view see {Wodarz and Nusse 1998) . The first Wnt gene, mouse wnt- 
1, was discovered as a proto- oncogene activated by integration 
of mouse mammary tumor virus in mammary tumors (Nusse and Varmus 
1982) . Consequently, the involvement of the Wnt pathway in can- 
cer has been largely studied. With the identification of the 
Drosophila polarity gene wingless (wg) as a wnt-2 homologue 
(Cabrera, Alonso et al . 1987; Perrimon and Mahowald 1987; Rijse- 
wijk, Schuermann et al. 1987), it became clear that wnt genes 
are important developmental regulators. Thus, although at first 
glance dissimilar, biological processes like embryogenesis and 
carcinogenesis both rely on cell communication via identical 
signaling pathways. In a current model of the pathway, the se- 
creted Wnt protein binds to Frizzle cell surface receptors and 
activates the cytoplasmic protein Dishevelled (Dsn) . Dsn then 
transmits the signal to a complex of several proteins, including 
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the protein kinase Shaggy/GSK3 f the scaffold protein Axin and p- 
Catenin, the vertebrate homologue of Armadillo. In this complex 
p-Catenin is targeted for degradation after being phosphorylated 
by Sgg. After Wnt signaling and the resulting down- regulation of 
Sgg activity, p-Catenin (or its Drosophila homologue Armadillo) 
escape from degradation and accumulate into the cytoplasm. Free 
cytoplasmic p-Catenin translocates to the nucleus by a still ob- 
scure mechanism, and modulates gene transcription through bind- 
ing the Tcf/Lef family of transcription factors (Grosschedl R 
1999) - 

This set up, in which the key transducer is continuously held in 
check, is highly susceptible to mutations in its inhibitory com- 
ponents. The loss of any of the three elements of the p-Catenin 
destruction complex leads to an increase in p-Catenin levels, 
and hence to the constitutive activation of the pathway. While 
this may reduce cellular viability, as upon loss of GSK-3 func- 
tion, it can also lead to cell fate changes, uncontrolled pro- 
liferation and tumori genie behavior as in the cases of APC and 
Axin (Barker N 1999; Morin 1999; Potter 1999; Roose and Clevers 
1999; Waltzer and Bienz 1999) . Attempts to counter these harmful 
situations must aim at curbing the nuclear activities of p- 
Catenin, either by preventing the formation of the p-Catenin-TCF 
complex or by interfering with its transcriptional activator 
function . 

Currently, there are no known therapeutic agents effectively in- 
hibiting P-Catenin transcriptional activation. This is partly 
due to the fact that many of the essential components required 
for its full activation and nuclear translocation are still un- 
known. Consequently, there is an urge to understand more about 
this pathway in order to be able to develop effective drugs 
against these highly malignant diseases. 

In order to identify new components required for Wingless acti- 
vation the inventors used a Drosophila genetic approach to 
screen for dominant suppressors of the rough eye phenotype 
caused by ectopic expression of Wingless, the Drosophila homo- 
logue of Wnt, during eye development. Three genes were identi- 
fied: the B-catenin homologue armadillo (arm) , the tcf/lef-1 
homologue pangolin (pan) and legless (lge) , a completely new 
gene (OS 09/915.543). The lgs gene was subsequently cloned and 
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its in vivo requirement for Wingless signal transduction in em- 
bryo and in developing tissues was confirmed. The presence of 
Lgs is required for a transcriptional active Arm/Pangolin com- 
plex and over-expression of Lgs strongly stimulates the tran- 
scriptional output of this bipartite transcription factor. The 
human genome contains at least two human Lgs homologues. One of 
them, Bcl9 ( has been previously implicated in B cell malignan- 
cies (Willis, Zalcberg et al. 1998) . It was also genetically and 
biochemically demonstrated that dLgs and hLgs bind to Armadillo 
and p-Catenin and are functionally required for Wnt signal 
propagation in human cells. However, genetic experiments 
strongly suggested the presence of a second protein which binds 
to Lgs and is essential for the function of the active p- 
Catenin-Pangolin-Lgs complex. 

The present invention describes the cloning and functional char- 
acterization of a novel Drosophila protein, named Daughter of 
Legless (Doll) , which binds to Lgs and is required for Wnt sig- 
naling. In addition, the invention provides the sequences of the 
functional and structural human and mouse homologues as well as 
methods to screen for compounds inhibiting Doll function in the 
Wnt pathway. 

Definitions 

The term *Doll polypeptide" , w Doll protein" when used herein en- 
compasses native invertebrate and vertebrate Doll and Doll vari- 
ant sequences (which are further defined herein) . 

A *wild type sequence Doll" comprises a polypeptide having the 
same amino acid sequence as a Doll protein derived from nature. 
Such wild type sequence of Doll can be isolated from nature or 
produced by recombinant and/or synthetic means. The term *wild 
type sequence Doll" specifically encompasses naturally occurring 
truncated forms, naturally occurring variant forms (e.g., alter- 
natively spliced forms) and naturally occurring allelic variants 
of Doll. In one embodiment of the invention, the wild type Doll 
sequence is a mature or full-length Doll sequence comprising 
amino acids 1 to 815 of cLDoll (Figure 1) , or 1 to 419 of hDoll- 
1, or 1 to 406 of hDoll-2 (Figure 2), or 1 to 417 of mDoll-i, or 
1 to 407 of mDoll-2 (Figures 3) . 
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*Doll variant" means an active Doll, having at least about 50% 
amino acid sequence identity with the amino acid ' sequence of a 
wild type Doll protein of Figure 1, 2 and 3. The term w Doll 
variant " however, does also include functional homologues of 
Doll in the Wnt pathway. 

tt Percent (%) amino acid sequence identity" with respect to the 
Doll sequences identified herein is defined as the percentage of 
amino acid residues in a candidate sequence that are identical 
with the amino acid residues in the Doll sequence, after align- 
ing the sequence and introducing gaps, if necessary, to achieve 
the maximum percentage sequence identity, and not considering 
any conservative amino acid substitution as part of the sequence 
identity. The % identity values used herein can be generated by 
WU-BLAST-2, which was obtained from (Tatusova TA 1999) . WU- 
BLAST-2 uses several search parameters, most of which are set to 
the default values. 

The term "positive" , in the context of sequence comparison per- 
formed as described above, includes residues in the sequence 
compared that are not identical but have similar properties 
(e.g. as a result of a conservative substitution) . The % value 
of positive is determined by the fraction of residues scoring a 
positive value in the BLOSUM 62 matrix divided by the total num- 
ber of residues in the longer sequence as defined above. 

In a similar manner, "percent (3r) nucleic acid sequence iden- 
tity" with respect to the coding sequence of the Doll polypep- 
tides identified herein is defined as the percentage of nucleo- 
tide residues in a candidate sequence that are identical with 
the nucleotide residues in any of the Doll coding sequences of 
this invention. The identity values used herein can be generated 
using BLAST module of WU-BLAST-2 set to the default parameters. 

The term "epitope tag" refers to a chimeric polypeptide compris- 
ing a Doll polypeptide fused to a ff tag polypeptide*. The tag 
polypeptide has enough residues to provide an epitope against 
which an antibody can be made, yet is short enough that it does 
not interfere with the activity of the Doll polypeptide to which 
it is fused. 

Nucleic acids are *operably linked" when are placed in a func- 
tional relationship with another nucleic acid sequence. 



WO 02/077023 



PCT/CH02/00063 



5 



The term w epistasis* means hierarchy in gene action. Epistasis 
experiments are performed to place components of a signaling 
pathway in the right order. 

The term „ rescue experiments* are designed to determine which 
gene is responsible for a specific mutant phenotype. Specifi- 
cally, mutant embryos are injected with coding or genomic DNA, 
and the effect of the introduced DNA is determined on the basis 
of the capacity to revert the mutant phenotype. 

„ Active* or „ activity* refers to forms of Doll polypeptides that 
retain the biological and/or immunological activity. A preferred 
activity includes for instance the ability to modulate the Wnt 
signaling pathway. 

The term "antagonist" is used in a broad sense, and includes any 
molecule that partially or fully inhibits, blocks or neutralizes 
a biological activity of Doll polypeptides described herein. In 
a similar way, the term "agonist" is used in the broadest sense 
and includes any molecule that mimics or support a biological 
activity of an active Doll polypeptide. 

„ Treatment* refers to both therapeutic treatments and prophylac- 
tic or preventive measures, wherein the objective is to prevent 
or slow down the targeted pathologic condition or disorder. 
Those in need of treatment include those already with the disor- 
der as well as those prone to have the disorder or those in whom 
the disorder is to be prevented. 

Summary of the invention 

The present invention relates to a novel family of proteins pre- 
sent in insects and vertebrate organisms, referred to hereinaf- 
ter as "Daughter of Legless (Doll)* proteins. These proteins 
play an essential role in the Wnt signaling pathway, and thus in 
the formation and maintenance of spatial arrangements and pro- 
liferation of tissues during development, and in the formation 
and growth of many human tumors. 
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In particular, the invention relates to nucleotide sequences of 
the Droaophila. melanogaster doll gene, of proteins encoded by 
said nucleotide sequences, as well as fragments, derivatives and 
structural and functional analogs thereof. 

In a preferred embodiment the invention relates to the nucleo- 
tide and protein sequences of the human and mouse doll homo- 
logues, hdoll-1, hdoll-2 and mdoll-1 and mdoll-2, respectively. 

In one embodiment, the isolated nucleic acid comprises a se- 
quence encoding a polypeptide having at least 50% amino acid se- 
quence identity, preferably at least about 70% sequence iden- 
tity, more preferably at least 90% sequence identity, even more 
preferably at least about 95% sequence identity, yet even more 
preferably at least about 98% sequence identity, and most pref- 
erably 100% identity to <a) a fragment or the entire protein se- 
quence of the Doll polypeptide shown in Figure 1, or (b) the 
complement of the nucleic acid molecule coding for (a) . 

In another preferred embodiment, the isolated nucleic acid en- 
codes a polypeptide having at least 50% amino acid sequence 
identity, preferably about 70% sequence identity, more prefera- 
bly at least 90% sequence identity, even more preferably about 
95% sequence identity, yet even more preferably about 98% se- 
quence identity, and most preferably 100% identity to (a) a 
polypeptide which is part or the entire human Doll polypeptides 
of figure 2a/b or (b) the complement of the nucleic acid mole- 
cule coding for (a) . 

In another embodiment, the isolated nucleic acid encodes a poly- 
peptide sequence having at least 50% amino acid sequence iden- 
tity, preferably about 70% sequence identity, more preferably at 
least 90% sequence identity, even more preferably about 95% se- 
quence identity, yet even more preferably about 98% sequence 
identity, and most preferably 100% identity to (a) a polypeptide 
encoding part of the entire mouse Doll protein of figure 3 a/b 
or (b) the complement of the nucleic acid molecule coding for 
(a) . 

In a further embodiment, the isolated nucleic acid comprises a 
sequence encoding a polypeptide with a low overall amino acid 
sequence identity but shows a sequence identity of at least 60%, 
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preferably at least 70%, more preferably at least 80%, even more 
preferably at least 90% and most preferably 100% in the con- 
served domains DHD and PHD (Figure 4) . 

In yet another embodiment of the present invention isolated nu- 
cleic acids encode polypeptides having a function resembling 
that of the doll genes . 

In another embodiment, the invention relates to a fragment of 
the Drosophila or vertebrate doll nucleic acid sequences that is 
applied as hybridization probe. Such nucleic acid fragments are 
about 20 to about 100 nucleotides in length, preferably from 
about 2 0 to about 60 nucleotides in length, most preferably from 
20 to 50 nucleotides in length and are derived from the nucleo- 
tides sequences shown in Figure 1, 2 and 3. 

The invention further provides eucaryotic and procaryotic ex- 
pression vectors comprising a nucleic acid molecule encoding 
Drosophila or vertebrate doll or a fragment thereof as shown in 
Figures 1, 2 and 3. The vector can comprise any of the molecules 
or fragments thereof described above. 

The invention also includes host cells comprising such a vector. 
By way of example, the host cells can be mammalian cells, yeast 
cells, insect cells, plant cells or bacteria cells. 

Methods of production, isolation and purification of the Doll 
proteins, derivatives and analogs, e.g. by recombinant means, 
are also provided (see Example VI, below) . In a specific aspect, 
the invention concerns an isolated Doll peptide, comprising an 
amino acid sequence of at least 80%, preferably at least about 
85% sequence identity, more preferably at least 90% sequence 
identity, even more preferably at least 95% sequence identity, 
yet most preferably 100% identity with the amino acid sequences 
of Figures 1, 2 and 3. 

In yet another embodiment the invention relates to chimeric pro- 
teins comprising a Doll polypeptide fused to a heterologous 
polypeptide or amino acid sequence. An example of such chimeric 
molecule comprises a Doll polypeptide fused to an epitope tagged 
sequence, glutathione- S- transferase protein or to a protein with 
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an enzymatic activity, such as beta-galactosidase or alkaline 
phosphatase as described in Example VI below. 

In a further aspect the invention concerns an isolated full 
length Doll polypeptide (prepared as described in Example VI) , 
comprising the amino acid sequences of Figure 1, 2 and 3, or any 
Doll polypeptide or a fragment thereof described in this inven- 
tion sufficient to provide a binding site for an anti-Doll anti- 
body . 

In another embodiment the invention provides antibodies that 
specifically recognize Doll polypeptides. The antibodies can be 
a polyclonal or a monoclonal preparation or fragments thereof . 
Polyclonal antibodies are prepared by immunization of rabbits 
with purified Doll polypeptides prepared as described in Example 
VI. 

The invention also relates to transgenic animals, e.g. Droso- 
phila, mice, rats, chicken, frogs, pigs or sheep, having a 
transgene, e.g., animals that include and preferably express a 
heterologous form of the Doll genes described herein, or that 
misexpress an endogenous or transgenic doll gene. Such a trans- 
genic animal can serve as a model for studying diseases with 
disrupted Wnt signaling pathway, for the production of Doll pro- 
teins, or for drug screening. 

In yet another embodiment, the invention also features animals, 
e.g. Drosophlla, mice, rats, chicken, frogs, pigs or sheep, hav- 
ing a mutation in the doll gene, e.g. deletions, point muta- 
tions, foreign DNA insertions or inversions. Such animals can 
serve to study diseases characterized by disrupted Wnt function 
or in drug screening. 

In addition, the invention relates to the use of Doll proteins, 
homologues, derivatives and fragments thereof as well as nucleic 
acids, derivatives and fragments thereof in therapeutic and di- 
agnostic methods and compounds. In particular, the invention 
provides methods and compounds for treatment of disorders of 
cell fate, differentiation or proliferation by administration of 
a therapeutic compound of the invention. Such therapeutic com- 
pounds include: Drosophlla and vertebrate Doll protein homo- 
logues or fragments thereof, antibodies or antibody fragments 
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thereto, doll antisense DNA or RJNA, doll double stranded RNA, 
and any chemical or natural occurring compound interfering with 
Doll function, synthesis or degradation. In particular, the in- 
vention provides methods to screen for chemical compounds, or- 
ganic products or peptides interfering with Doll function in the 
Wnt pathway. In a preferred embodiment the screening method will 
be a cellular reporter gene assay or a protein-protein interac- 
tion assay. 

In another embodiment, a screening assay based on protein- 
protein interaction is used to screen for compounds specifically 
inhibiting Doll-Lgs or Doll -interact ion partner X. 

The invention also provides methods to screen for chemical com- 
pounds, organic products or peptides interfering with Doll func- 
tion in the Wnt pathway. 

Furthermore, the invention comprises the use of the DHD domain 
in screening assays such as an in vitro protein-protein interac- 
tion assay or a protein-protein interaction in a host cell. Said 
assays are applied for the identification of chemical compounds, 
organic products, polypeptides or peptides interfering with Doll 
function in the Wnt pathway. 

In a preferred embodiment, a therapeutic product according to 
the invention is administered to treat a cancerous condition or 
to prevent progression from a pre -neoplastic or non-malignant 
condition to a neoplastic or malignant state. 

In other specific embodiments, a therapeutic product of the in- 
vention is administered to treat a blood disease or to promote 
tissue regeneration and repair. Finally disorders of cell fate, 
especially hyperprolif erative or hypoprolif erative disorders, 
involving aberrant or undesirable expression, or localization, 
or activity of the Doll protein can be diagnosed by detecting 
such levels. 

Brief description of the drawings 
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Figure 1 The Drosophila doll cDHA and protein sequence. 

Figure 2 The human doll-1 and doll -2 cDNA and protein sequence. 

Figure 3 The mouse and doll- 2 cDNA and protein se- 

quences. 

Figure 4. Drosophila and human Doll proteins contain a PHD fin- 
ger motif with which they bind to the HDl of Lgs/BCL9 

(A) Top: Schematic representation of Doll, human DOLL-1 and hu- 
man DOLL-2. The two domains that show high sequence similarities 
are highlighted in dark gray (DHD: Doll homology domain) and red 
(PHD: plant homology domain) . The DHD appears to be unique, as 
the inventors failed to find a similar sequence in other Droso- 
phila or human proteins. GenBank accession numbers for Doll, 
hDoll-1, hDoll-2 are AF457206, AF457207, AF457208, respectively. 
Bottom: Multiple alignment of Drosophila, human and mouse Doll 
protein sequences. 

(B) Alignment of amino acid sequences of DHD and PHD in Doll and 
its human homologues. 

Similarities are boxed, identities shaded in gray. The numbers 
to the left indicate the positions of DHD and PHD within their 
respective protein sequences. For the DHD alignment a gap of 22 
aa has been introduced in the Drosophila DHD (represented as 
(X)5) . 

(C) Mapping of the Lgs/BCL9 interaction site in Doll. Schematic 
representation of the proteins tested in the yeast -two-hybrid 
assay for their interactions with Lgs and BCL9 . Results are in- 
dicated to the right { w bdg ff ) . 

(D) Mapping of the Doll interaction site in dLgs and hLgs/BCL9 . 
Schematic representation of the proteins tested in the yeast - 
two-hybrid assay for their interactions with Lgs. The two pro- 
teins shown at the bottom were tested by a pull-down assay for 
both dLgs (numbers without brackets) and hLgs/BCL9 (numbers in 
parenthesis) with the same result (*bdg*) . The deletion removing 
HDl comprises aa 318-345 for Lgs and aa 177-204 for hLgs/BCL9. 
Fusion proteins used were S-Tag-dDoll (aa 542-815) and GST- 
hDOLL-2 (aa 301-406) . 
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Figure 5. doll is a segment polarity gene required for Wg sig- 
nalling 

(A-C) Cuticle preparations of larvae derived from wild-type (A) , 
wg mutant (B) , and doll mutant embryos (C) . The doll 130 /doll 130 
embryo in (C) is derived from a homozygous doll 130 mutant germ 
line clone (see Experimental Procedures) and displays a wg-like 
phenotype - 

(D,E) doll functions downstream of dAPC2. Two cuticle prepara- 
tions are shown from larvae that developed in the absence of ma- 
ternal and zygotic wild- type dAPC2 function (McCartney et al., 
1999) . The embryo in (B) additionally lacks the maternal and zy- 
gotic function of doll (see Examples) . In contrast to dAPC sin- 
gle mutant animals, which have strongly reduced denticle belts, 
double mutants display a doll-like phenotype. 

(F-I) Confocal images of third instar wing imaginal disc prepa- 
rations stained with antibodies against Doll (F,G) and Ptc 

. Wild- type animals show normal expression of these genes 
(F,H) . Discs derived from doll 130 mutant larvae are small, yet 
express Ptc (I) , but fail to express Dll (H) . I>ack of Dll ex- 
pression may be an indirect consequence of the earlier wing-to- 
notum transformation in doll 130 larvae. However, we also see a 
strong reduction of Dll expression in doll 130 mutant cells from 
mosaic animals (not shown) . 

Figure 6. Lgs and the PHD finger of Doll serve to assemble Doll 
and Arm 

Schematic representation of Lgs (yellow) and Doll (light green) 
constructs that were used in transgene assays to assess their 
ability to rescue lgs or doll mutant animals. 1: Full-length Lgs 

(pOP216, aa 1-1464) . 2: C- terminally truncated Lgs (pTK131, aa 
1-583). 3: HD1-Galll-HD2 (pTK153, aa 268-395 (HD1) , aa 369-500 

(Galll) , aa 465-596 (HD2 ) ) . 4: HD1-(HA)3 -HD2(pTK143, aa 268-395 

(HDD, aa 465-596 (HD2) ) . 5: Pull-length Doll (pTK56, aa 1-815). 
6: Doll [DPHD] -HD2 (pTK135, aa 1-740 (Doll [DPHD] ) , aa 483-561 

(HD2 ) ) . Transgenes 1 to 4 are 

able to rescue lgs 20F homozygotes . An example for an adult animal 
rescued by transgene 3 is shown on the right. Transgene 5 can 
rescue doll 130 homozygotes. Transgene 6 can rescue doll 130 as well 
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as lgs 20F homozygotes (photographs on the right) . 

Figure 7: Rescue of ddoll-/- flies by expression of a human doll 
transgene. The lethality caused by the doll 130 / EP(3)1076 geno- 
type can be fully rescued by a tubulin 1 promoter -driven trans- 
gene that contains either the coding region of the Drosophila 
doll gene (not shown) or that of one of its two human homologues 
hdoll-1 and hdoll-2. 

Figure 8: Effects of human Doll 1 and 2 on Tcf transcription. 
293 cells were transiently transfected with the pTOPFLASH or 
pFOPFIiASH luciferase reporters and different effector plasmids 
as indicated. A const i tut ively active form of (J-Catenin (AN-p- 
Catenin, 50 ng) or human Doll-1 or hDoll-2 (350 ng) activate 
the pTOPFLASH reporter. Cotransf ection of human Doll with AN-P- 
catenin strongly enhance the response. 

Detailed description of the invention 

The Wnt signaling cascade is essential for the development of 
both invertebrates and vertebrates, and has been implicated in 
tumor igenesis. The Drosophila wg genes are one of the best char- 
acterized within the Wnt-protein family, which includes more 
than hundred genes. In the Drosophila embryo, wg is required for 
formation of parasegment boundaries and for maintenance of en- 
grailed (en) expression in adjacent cells. The epidermis of em- 
bryo defective in wg function shows only a rudimentary segmenta- 
tion, which is reflected in an abnormal cuticle pattern. While 
the ventral cuticle of wild type larvae displays denticle belts 
alternating with naked regions, the cuticle of wg mutant larvae 
is completely covered with denticles. During imaginal disc de- 
velopment, wg controls dorso-ventral positional information. In 
the leg disc, wg patters the future leg by the induction of ven- 
tral fate (Struhl and Basler 1993) . In animals with reduced wg 
activity, the ventral half of the leg develops into a mirror im- 
age of the dorsal side (Baker 1988) . Accordingly, reduced wg ac- 
tivity leads to the transformation of wing to notal tissue, 
hence the name of the gene (Shanna and Chopra 1976) . In the eye 
disc, wg suppresses ommatidial differentiation in favor of head 
cuticle development, and is involved in establishing the dorso- 
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ventral axis across the eye field (Heberlein, Borod et al. 
1998) . 

Additional genes have been implicated in the secretion, recep- 
tion or interpretation of the Wg signaling. For instance, ge- 
netic studies in Drosophila revealed the involvement of frizzled 
(Dfz) , Dishevelled {dsh) , shaggy/ zeste- white- 3 (sgg/zw-3) , arma- 
dillo (arm) , adenomatous polyposis coli {E~apc) , axin, and pan- 
golin (pan) in Wg signaling. The genetic order of these trans- 
ducers has been established in which Wg acts through Dsh to in- 
hibit Sgg, thus relieving the repression of Arm by Sgg, result- 
ing in the cytoplasmic accumulation of Arm and its translocation 
to the nucleus. In the nucleus Arm interacts with Pan to acti- 
vate transcription of target genes. Vertebrate homologues have 
been identified for all these components (for an updated review 
see (Peifer and Polakis 2 000) , suggesting that novel identified 
members of the Drosophila signaling pathway may likely have ver- 
tebrate counterparts. 

Mutations leading to nuclear accumulation of the mammalian homo- 
logue of Arm, S-Catenin, and consequently to constitutive acti- 
vation of the Wnt pathway have been observed in many types of 
cancer, including colon cancer, breast cancer, melanoma, hepato- 
cellular carcinoma, ovarian cancer, endometrial cancer, medullo- 
blastoma pi lomatri comas, and prostate cancer (Morin 1999; Po- 
lakis, Hart et al. 1999). It is now apparent that deregulation 
of P-Catenin signaling is an important event in the genesis of 
these malignancies. However, there are still no known therapeu- 
tic agents effectively inhibiting S-Catenin transcriptional ac- 
tivation. This is partly due to the fact that many of the essen- 
tial components required for its full activation and nuclear 
translocation are still unknown. 

In order to identify new components required for Wingless acti- 
vation the inventors used a Drosophila genetic approach to 
screen for dominant suppressors of the rough eye phenotype 
caused by ectopic expression of Wingless (Wg) , the Drosophila 
homologue of Wnt, during eye development. A new gene, legless 
(lgs, US09/915.543) was identified as a strong dominant suppres- 
sor of the rough eye phenotype. The gene was subsequently cloned 
and its in vivo requirement for Wg signal transduction in embryo 
and in developing tissues was confirmed. The human genome con- 
tains at least two human Lgs homologues, h!igs/Bcl9 and hLgs-1. 
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dLgs and hLgs bind to Armadillo and p-Catenin and are function- 
ally required for Wnt signal propagation in invertebrate and 
vertebrate cells (US 09/915.543) . In particular, the presence of 
Lgs is required for a transcriptional active Arm/Pangolin com- 
plex and over- expression of Lgs strongly stimulates the tran- 
scriptional output. 

The inventors later made the interesting observation that a mu- 
tant form of Lgs protein from which the p-Catenin interacting 
domain was deleted exhibited a strong dominant -negative effect 
on Wg-dependent patterning processes when expressed from a 
transgene in wild- type larvae (data not shown) . This strongly 
suggested that Lgs normally interacts not only with Arm, but 
also with at least one additional component. In an effort to 
identify such components yeast -two -hybrid screens for interact- 
ing proteins were carried out. In two independent screens in 
which either the entire protein or the N- terminal half of Lgs 
was used as a bait, a novel PHD finger protein, referred to as 
Daughter-of-Legless (Doll) , was identified as a Lgs binding pro- 
tein {Figure 4) . The 815 amino acid residue Doll protein carries 
a C- terminal domain of 60 amino acids (Figure 4a) , which shows 
extensive homologies to the PHD (plant homology domain) finger, 
also known as LAP (leukemia associated protein) domain {Aasland, 
Gibson et al . 1995). This domain comprises a cysteine rich Zn- 
binding motif, that has been associated with proteins involved 
in chromatin- mediated regulation of transcription. The PHD fin- 
ger of Doll is necessary and sufficient to mediate the interac- 
tion to Lgs (Figure 4c -d) . The inventors also demonstrate herein 
that this interaction is essential for Doll function. 

The region of Lgs responsible for Doll -binding was mapped to the 
HD1 sequence (Figure 4d) . Moreover, two human homologues of the 
Drosophila doll gene were identified and isolated (Figure 4a) . 
The protein products of both human genes, hDOLL-1 and hDOLL-2, 
as well as their mouse homologues possess a highly conserved PHD 
finger which interacts with the HD1 of hLgs/BCL9 (Figure 4d) . 
The only other domain in Drosophila Doll, hDoll-l/hDoll-2 and 
mDoll-i/mDoll-2 that shows significant sequence homology is a 50 
amino acid stretch in the N-terminal region, which is referred 
herein to as % Doll homology domain' (DHD, Figure 4a, b) . 

The interaction with Doll appears to be relevant for the in vivo 
function of Lgs, since a mutant form of Lgs with a deletion of 
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HD1 was unable to rescue lgs mutant animals. The physical asso- 
ciation of Doll and Lgs suggested that Doll, like Lgs, may be 
required for Wg signaling in vivo. To explore this hypothesis a 
proprietary collection of suppressors of the sev-wg phenotype 
was searched for mutations that map to the tip of the right arm 
of chromosome 3, the position of the doll gene- One such sup- 
pressor, Sup 130 , mapped to this position, and intriguingly, it 
showed dominant lethality in combination with the lgs allele 
lgs 17B (US09/915.543) (Sup 130 /+ lgs 17B /+ transheterozygous ani- 
mals do not survive) . The doll coding region was sequenced using 
genomic DNA from homozygous Sup 130 mutant larvae and a 14 bp de- 
letion starting at amino acid position 751 was identified. Hence 
this allele is referred to as doll 130 and encodes a truncated 
Doll protein lacking the C-terminal PHD finger. 

The lethality caused by the homozygous doll 130 genotype can be 
fully rescued by a tubulin 1 promoter- driven transgene that con- 
tains either the coding region of the Drosophila doll gene or 
that of one of its two human homologues hDoll-1 and hDoll-2 
(Figure 7). Thus, the * vertebrate homologues of doll were con- 
firmed genetically to be true functional homologues of Doll, and 
hence the vertebrate homologues are part of this invention. 
To assay the possible role of Doll in Wg signal transduction 
during development, embryos homozygous for the doll 130 mutation 
that derived from female germ cells equally mutant for doll were 
generated. Doll mRNA is maternally contributed and strongly and 
ubiquitously expressed during all the developmental stages. Con- 
sequently, only embryos lacking both embryonal and maternal doll 
are characterized by a severe segment polarity phenotype (Figure 
5A-C) , while weaker loss of function doll mutants display pupal 
lethality with a partial or complete loss of the antennae and 
the legs. Mutant individuals lacking only zygotic function sur- 
vive until early pupal stages and exhibit imaginal discs that 
are abnormally small . The Hh target gene ptc was expressed at 
wild- type levels in these discs, however, no expression of the 
Wg target Dll could be detected (Figure 5F-I) . These discs ap- 
pear to lack the presumptive wing blade field and possess two 
primordia for the notum (Figures 5G and 1} . The fact that simi- 
lar phenotypes are caused by loss of function of wg, dsh, arm, 
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or Igs confirms the essential role of doll in the Wg signaling 
pathway - 

To address the role of Doll in (3-Catenin-mediated transcription 
a TCF reporter gene (TOPFLASH, {Morin, Sparks et al. 1997)) was 
used in immortalized human embryo kidney cells (HEK 293 cells) . 
Low levels of a stable mutant form of p-Catenin (AN-p-catenin; 
(van de Metering, Cavallo et al. 1997)) were introduced into 
these cells to partially stimulate the pathway- The additional 
expression of hDoll-1 (Figure 8) or hDoll-2 (not shown) lead to 
a large increase in lucif erase activity (30 -fold) . These levels 
are significantly higher than the sum of those produced by ei- 
ther treatment alone (Figure 8) . This potentiation of p-Catenin 
activity by hDoll-l and 2 appears to be mediated by the interac- 
tion of endogenous TCF protein with its DNA target sites, as it 
is only observed with TOPFLASH, which contains five optimal TCF 
binding sites, but not with the control reporter FOPFLASH, which 
contains five mutated sites (Morin, Sparks et al . 1997). Thus 
this experiment adds supportive evidence to the notion that Doll 
proteins transduce Wnt signals by activating TCF target genes in 
a p-Catenin- dependent manner. 

In summary, the protein-protein interactions demonstrated be- 
tween Drosophila Doll and Lgs and those between their human 
homologues human Doll and hLgs/Bcl9, respectively, in conjunc- 
tion with the genetic and cell biological data show that Doll 
proteins are positive regulators of the Wg and Wnt signaling 
pathways , respectively . 
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EXAMPLES 

Example I: Isolation of doll cDNA 

The cDNA for Daughter of Legless (Doll) was isolated in two in- 
dependent yeast genetic screens of a Drosophila cDNA- library for 
proteins directly binding to Lgs. Other DNA libraries can be 
used as well, such a genomic and cDNA libraries from vertebrate 
and invertebrate organisms. Other methods than a yeast-two hy- 
brid screen can be used as well. Such methods include, but are 
not limited to, direct amplification using gene specific prim- 
ers and standard methods known by people skill in the art . To 
perform the yeast two-hybrid screening for protein binding to 
Lgs cDNA sequences encoding the first 732 amino acids ("LgsN") 
and the full-length protein of 1464 amino acids (*I*gsFL w ) were 
subcloned into a yeast expression vector (pLexA, Clontech) , fus- 
ing them to the LexA DNA- binding domain. Subsequently these con- 
structs were transformed into the LBU2- reporter yeast strain 
EGY48 together with the lacZ- reporter plasmid pSH18-34 and an 
embryonic Drosohila. melanogaater cDNA- library fused to an acidic 
transcriptional activation domain ("RFLY-l" library, PNAS 93, 
3011-3015) . In a first step triple - trans formant colonies con- 
taining the LgsN- or LgsFL-LexA- fusion constructs, respectively, 
the pSH18-34 reporter and a RFLY-1 library plasmid were grown on 
minimal selective medium plates for two days, harvested, thor- 
oughly mixed, and stored as uniform aliquots. Then cells from 
one of these aliquots were transferred into permissive Galac- 
tose/Raf f inose minimal selective liquid medium, and incubated 
with shaking at 30°C for a few hours, thereby inducing expres- 
sion of the library cDNA- activation domain fusion from the GAL1- 
inducible promotor. Finally these "induced" cells were plated on 
Galactose/Raff inose minimal selective medium plates lacking the 
amino acid 1-leucine. On these plates cell growth was sustained 
only upon activation of a LEU2- selector gene through molecular 
interaction of the respective LexA- fusion and activation domain- 
fusion proteins. The I»KCT2-gene codes for an essential metabolic 
enzyme needed for the biosynthesis of leucine from other amino 
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acid precursors. All clones growing under these restrictive con- 
ditions were isolated and analyzed for the activity of the lacZ- 
reporter gene, encoding the metabolic enzyme S-Galactosidase 
from the enterobacterium E.coli, by a standard X-Gal assay (e.g. 
Bartel and Fields (eds . ) , Oxford University Press 1997). From 
all candidate clones that passed these two selection steps, the 
cDNA-library plasmids were isolated again by standard techniques 
(e.g. Methods in Yeast Genetics, Cold Spring Harbour Laboratory 
Press, 1997) and retested for specific interaction with Lgs in 
the X-Gal assay, using an unrelated LexA- fusion protein as a 
negative control. By this procedure three independent cDNA- 
clones were identified, that strongly and specifically inter- 
acted only with Lgs and contained partially overlapping se- 
quences: BK12b, BK14b and TK5.35h. By searching the Drosophila 
genome database using the blastn algorithm 

( ht tp: //www. ncbi. nlm.nih.gov: 8 0 /BLAST/ ) we mapped the three iso- 
lated cDNAs to the CG11518 locus, coding for a protein product 
of 815 amino acids in length. The cDNA-clones coded different 
parts of the Doll protein, with BK12b containing nucleotides 
(nt) 2223-2448, BK14b nt 2191-2448 and TK4.35h nt 749-2448 of 
the computationally predicted open reading frame (ORF) . Further 
bioinformatical analysis ( http : / / www . ebi . ac . uk/ int erpr o/ ) re- 
vealed that the very C- terminal part of the protein sequence 
(ca. aa 745-805) , present in all three of the Lgs-binding 
clones, was predicted to adapt a PHD- finger fold, which has been 
identified in other proteins involved in transcriptional regula- 
tion at different levels. 

Example II: Identification of Human and House Homologies of 
Drosophila Doll 

After the identification of the Drosophila Doll amino acid se- 
quence, publicly available databases were searched for similar 
protein sequences in other species, using the tblastn algorithm 
( http://www.ncbi.nlm.nih.gov: 8 0/blast/) . Two candidate sequences 
were found each in ESTs from Mus musculus and Homo sapiens, re- 
spectively, the putative protein products of which display high 
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similarity to Doll in their C- terminal domains. These stretches 
of high similarity are predicted to adapt a PHD- finger fold as 
well (Figure 4) . Doll proteins do not display other known struc- 
tural motifs in their N-terminal sequences but they display a 
second high homology domain, which was accordingly named DOLL 
Homology Domain (DHD) (Figure 4) . Doll proteins of both inverte- 
brate and vertebrate origin have so far not been further de- 
scribed or experimentally studied, and have thus not previously 
been implicated in any specific biological process. 

Example III : Isolation and mapping of Drosophila doll alleles 

EMS-treated males were crossed to females carrying a wg trans- 
gene (sev-wg) driven by two copies of the sevenless enhancer 
(Basler, Christen et al . 1991) . 2 x 10 5 progeny were screened 
for suppressors of the rough eye phenotype . Third chromosomal 
suppressors were coarsely mapped by meiotic recombination using 
a panel of P [y + ] insertions. One such suppressor. Sup 130 , 
showed intriguingly dominant lethality in combination with the 
lgs allele lgs 17B (US09/915 . 543) (Sup 130 /+ lgs 17B /+ transhetero- 
zygous animals do not survive) , strongly suggesting a close ge- 
netic interaction. Fine mapping of the mutation using denaturing 
HPLC (WAWE system, Transgenomic Inc.) demonstrated that it lo- 
calizes within the doll gene. The doll coding region was there- 
fore sequenced using PCR fragments covering the doll coding re- 
gion derived from genomic DNA from homozygous Sup 130 mutant lar- 
vae. The defect in Sup 130 was found to be a 14 bp deletion (nu- 
cleotides 2253 to 2266: 5' CATGTGCCACAAGG 3') within the doll 
open reading frame that induced a frame -shift subsequent to 
amino acid 751 and resulted in the formation of a premature stop 
codon. Hence this allele is referred to as doll 130 and encodes a 
truncated Doll protein lacking the C- terminal PHD finger. 

Pole cell transplantation, chromosome squashes, and chromosome 
in situ hybridization experiments were carried out according to 
standard protocols (Ashburner 1989) . 



Example IV: Use of doll as a hybridization probe 
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The following method describes the use of a non- repetitive nu- 
cleotide sequence of doll as a hybridization probe. The method 
can be applied to screen for doll homologues in other organisms 
as well. DNA comprising the sequence of doll (as shown in Fig- 
ures 1,2,3) is employed as probe to screen for homologue DNAs 
(such as those included in cDNA or genomic libraries) . 
Hybridization and washing of the filters containing either li- 
brary DNAs is performed under standard high stringency condi- 
tions (Sambrook, Pritsch et al . 1989) . Positive clones can be 
used to further screen larger cDNA library platings. Representa- 
tive cDNA- clones are subsequently cloned into pBluescript 
(Stratagene) or similar cloning vectors and sequenced. 

Example V: Use of doll as a hybridization probe for in situ hy- 
bridization. 

In situ hybridization of Drosophila doll mRNA can be performed 
in embryo as described in (Tautz and Pfeifle 1989) . However, 
with small modifications it can also be used to detect any mRNA 
transcript in Drosophila larval imaginal discs or vertebrate 
tissue sections. Labeled RNA probes can be prepared from line- 
arized doll cDNA (as showed in Figures 1,2,3), or a fragment 
thereof, using the DIG RNA labeling Kit (SP6/T7) (Boehringer 
Mannheim) following the manufacturer's recommendations. 

Example VI j Expression of doll in Drosophila melanogaster 

Doll can be expressed in Drosophila in the whole organism, in a 
specific organ or in a specific cell type, during the whole life 
or only at a specific developmental stage, and at different lev- 
els. An overview of the standard methods used in Drosophila ge- 
netics can be found in (Brand and Perrimon 1993; Perrimon 1998; 
Perrimon 1998) . 

Generation of doll mutant embryos 

Mosaic germlines are generated with the help of site-specific 
recombination through the FLP recombinase (Xu and Rubin 1993) . 
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Females of the genotype hsp70:flp, FRT82 doll 130 / FRT82 ubi-GFP 
are heat-shocked at 37°C for 1 hr during the third ins tar larval 
stage to induce FLP-directed recombination and later mated to 
doll 130 / TM6b[y+] males. Germline mosaics are induced. The 
source of recombinase is a first chromosome insertion of a fu- 
sion of the hsp70 promoter (denoted by *hsp70") to the FLP cod- 
ing sequence. Somatic recombination at the FRT82 sites gives 
rise to adult female germ line that produces oocytes that upon 
fertilization lead to embryos which do not contain neither zy- 
gotic nor maternally contributed information for the production 
of functional dDoll protein. Those embryos can be identified by 
the absence of the yellow+ phenotype provided by the TM6b[y+] 
paternal balancer chromosome. For analysis, cuticles are pre- 
pared by standard techniques from mutant embryos, and examined 
by dark field microscopy. 

dAPC2 doll doublemutant germ line clones were generated with an 
FRT82 dAPC2 DS doll 130 chromosome. The FRT82 ovo m chromosome 
(Chou and Perrimon 1996) was used to select for mutant germ 
cells. The FRT82 doll 130 chromosome was also used to create doll 
mutant clones in discs, in conjunction with an FRT82 arm-lacZ 
chromosome . 

Generation of doll mutant embryos expressing const i tut ively ac- 
tive Arm 

In order to express constitutively active Arm ("AArrn"), females 
of the genotype described above are heat shocked at 37°C for 1 
hr during late pupal Btages and mated to males of the genotype 
UAS:AArm hsp70-Ga!4 / UAS:AArm hsp70-Ga!4; doll / TM6b[y+] . Due 
to the presence of the additional transgenes in these males off- 
spring that had arisen from a doll mutant oocytes and doll mu- 
tant sperm express upon heat treatment the constitutively active 
Arm protein, that transiently induced Wingless target genes. 

Example VII: Rescue of ddoll-/- flies with hdoll-1 and hdoll-2 
cDNA expression 

In order to confirm the functional homology between Droso- 
phila and human Doll-1 and human Doll-2, the human genes were 
introduced into Drosophila flies carrying two mutant doll al- 
leles {ddoll-/- flies) . Specifically, flies carrying e.g. a 
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tub:hdoll transgene, and two mutant doll alleles, e.g. doll 130 
and EP(3)1076 (publicly available) were generated, ddoll-/- mu- 
tant flies display larval or pupal lethality. In contrast, 
ddoll-/- mutant flies carrying at least one copy of the 
tub:hdoll-l or tub:hdoll-2 transgenes survive to adulthood. This 
demonstrates that both, hDoll-1 and hDoll-2, can replace endoge- 
nous dDoll function in flies and thus validates functional ho- 
mology between Drosophila and human Doll (Figure 7) . 

Example VIII: Protein production and purification of Doll in B. 
coli 

The following method describes recombinant expression of Doll 
proteins in bacterial cells. DNA encoding full-length or a trun- 
cated Doll form is fused e.g. downstream of an epitope tag or 
glutathione-S-transf erase (GST) cDNA and a thrombin or entero- 
kinase cleavage site contained within an inducible bacterial ex- 
pression vector. Such epitope tags include poly-his, S-protein r 
thioredoxin and immunoglobin tags. A variety of plasmids can be 
employed, including commercially available plasmid such as pGEX- 
4T (Pharmacia) or pET-32a (Novagen) . 

Briefly, a bacterial expression plasmid containing the doll se- 
quence, for instance fused to a GST- sequence, is transformed by 
conventional methods into protease deficient E.coli such as BL21 
(Stratagene) . A bacterial colony containing the plasmid is then 
expanded overnight in selection medium to reach saturation. The 
next morning, this culture is diluted 1:100 and bacterial are 
allowed to growth to an optical density (OD 60 o) of 0.6. Protein 
production is initiated by addition of an inducer of the pro- 
moter under which GST-Doll fusion protein is expressed. A vari- 
ety of inducers can be employed depending on the expression vec- 
tor used, including IPTG. 

Expressed GST tagged Doll can then be purified, for instance, 
using affinity beads or affinity chromatography, such as gluta- 
thione beads (commercially available from Pharmacia) . Extracts 
are prepared by lysing the Doll -expressing bacteria in sonica- 
tion buffer (10 mM Tris HC1 pH 8.0, 150 mM NaCl, 1 mM EDTA, 1.5% 
sarkosyl, 2% Triton-X-100, 1 mM DTT and protease inhibitors) , 
followed by short sonication on ice (e.g. 3 times 20 seconds at 
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middle power) and centrifugation. Cleared supernatant s are then 
incubated under gentle rotation for example with glutathione 
beads for 1 hrs at 4°C. Next beads are washed several time in 
washing buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM 
DTT, 1 mM MgCl 2 , 0.5V NP40) , and finally stored in storage 
buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 
mM MgC12, 10 % glycerol, 0.5% NP40, and proteinase inhibitors). 

Alternatively, a His-tagged, S-protein, thioredoxin or IgG 
tagged Doll can be purified using affinity chromatography. 

The quality of the preparations can be checked e.g. by SDS-gel 
electrophoresis and silver staining or Western blot. 

In case the epitope -tag has to be cleaved, several methods are 
available depending on the presence of a cleavage site between 
the epitope-tag and the Doll protein. For example, it is possi- 
ble to produce a GST-Doll fusion protein containing a thrombin 
cleavage site right before the first Doll amino acid. Briefly, a 
GST-Doll preparation on glutathione -af f inity beads is washed 
several times in cleavage buffer (50 mM Tris HC1 pH 7.0, 150 mM 
NaCl, 1 mM EDTA, 1 mM DTT) . Thrombin is then added and the sam- 
ples are incubated for over 16 h at room temperature. Super- 
natants are then collected and analyzed for successful cleavage 
of Doll from the beads by polyacrylamide gel electrophoresis and 
silver staining or Western blot. 



Example IX: Protein-protein interactions involving Doll 

A GST- fusion protein in vitro binding assay can be performed to 
map binding domains and find additional interaction partners. 
For this purpose, proteins are in vitro translated using reticu- 
locyte lysates (e.g. TNT-lysates, Promega Corporation) contain- 
ing [ 35 S] methionine following the instructions provided by the 
manufacturer. Alternatively, cellular proteins can be labeled by 
incubation of culture cells with [ 3 5 S] methionine . Glutathione S- 
transf erase (GST) fusion proteins, produced as illustrated in 
the Example VIII, are immobilized on glutathione -Sepharose and 
blocked in binding buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM 
EDTA, 1 mM DTT, 1 mM MgCl2, 10 % glycerol, 0.5% NP40, 0.05% BSA, 
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and proteinase inhibitors) for 45 rain. Two fig of immobilized GST 
proteins are then incubated for 1.5 hrs with 0.5-6 ul of in vi- 
tro translated proteins in binding buffer or with 
[ 35 S] methionine labeled cell extract. The beads are washed four 
times in washing buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM 
EDTA, 1 mM DTT, 1 mM MgCl 2 , 0.5% NP40) and boiled in Laemrali SDS 
sample buffer. Proteins binding to Doll are detected by autora- 
diography. In case that a cell lysate were used to identify 
novel Doll binding partner, the protein bands on the gel can be 
isolated by methods known in the art, and the protein sequence 
can be determined e.g. by mass spectrophotometrical analysis. 

A yeast two hybrid assay can additionally be performed to con- 
firm the results of the in vitro binding assays described above 
or to screen cDNA library for new interaction partners (Fields 
and Sternglanz 1994) . In this context, the desired cDNAs are 
subcloned into appropriate yeast expression vectors that link 
them either to a Lex DNA binding domain (e.g. pLexA, Clontech) 
or an acidic activation domain (e.g. pGJ4-5, Clontech). The ap- 
propriate pair of plasmids is then transformed together with a 
reporter plasmid (e.g. pSH18-34, Clontech) into an appropriate 
yeast strain (e.g. EGY48) by the lithium acetate -polyethylene 
glycol method and grown on selective media (Sambrook, Fritsch et 
al. 1989) . Transformants are analyzed for reporter gene activity 
as described by the manufacturer of the vector- reporter plasmid 
used. To establish reproducibility the interactions is tested in 
both directions. Alternatively, this method is used to screen 
for novel Doll interaction partners. In this context, e.g. 
pLexA-Doll is transfected into yeast together with a cDNA li- 
brary cloned into e.g. pGJ4-5 as described above. Positive 
clones can be isolated and the cDNA they contain can be se- 
quenced by methods known by people skilled in the art. 

Example X: Immunohistochemistry 

Localization of the Doll proteins is performed on Droaophila em- 
bryo, imaginal discs, invertebrate and vertebrate adult tissue 
sections or tumor cell lines using the anti-Doll antibodies pro- 
vided by this invention. For instance, if a tumor cell line is 
used, cells can be seeded into polylysine- coated 8 well chambers 
(Nalge-Nunc Internat . ) and grown overnight at 37°C. As a posi- 
tive control, 293 MEK cells (ATCC) cells might be transfected 
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e.g. by a lipofection method (e.g. Lipofectamine, Gibco tech- 
nologies) with a Doll expression plasmid, such as pcDNA3.1 (In- 
vitrogen) . Two days after transfection, cells are washed and 
fixed with 3.7% formaldehyde in PBS for 10 min, permeabilized in 
0.5% Triton-X-100 for another 10 min, and blocked with a 1:1000 
dilution of pre-immunoserum in 2% BSA-PBS for lh at RT. Cells 
are then incubated with a 1:1000 dilution of anti-Doll immunose- 
rum for 2 hrs at RT, followed by washing in PBS and staining 
with anti-rabbit secondary- antibody . The washing step is re- 
peated and preparations are blocked in a solution of 3% BSA in 
PBS/0.1% TritonX-100 for 1 hr. The slides are then washed three 
times for 5 min in PBS and incubated with a 1:200 dilution (v/v) 
of TRITC- conjugated swine ant i -rabbit immunoglobulin (Dako, 
inc . ) . The washing step is repeated before applying coverslips 
using Vectashield* mounting medium (Vector Laboratories, Inc.). 
Detection of other proteins such as p-Catenin, hLgs or Tcf can 
be performed in the same way using anti-p-Catenin (commercially 
available), anti-hLgs (US 09/915.543) or anti-Tcf (commercially 
available) specific antibodies, respectively. 



Example XI: Lucif erase reporter gene assays 

The effect of Doll on Tcf transactivation activity can be per- 
formed in a cell culture system using a Tcf responsive lucifer- 
ase reporter gene. Depending on the expression vector used, this 
protocol can be applied for mammalian as well as for Drosophila. 
cell lines. For instance, HEK293 cells (ATCC) are a well suit- 
able system. Hereby, Doll full length cDNA is cloned into a mam- 
malian expression vector, such as pcDNA3 (Invitrogen) , and 
transfected together with the TOPFLASH lucif erase reporter plas- 
mid (Upstate biotechnology. New York, USA) into 293 cells. A 
lipofection agent like the Lipofectamine transfection reagent 
(Life Technologies, Inc.) can be used for this purpose. A 
renilla lucif erase reporter plasmid, e.g. pRL-SV40, (Promega 
Corporation, Madison USA) , is co-transf ected to normalize the 
transfection efficiency. Cell extracts are prepared 48 h after 
transfection and assayed for firefly and renilla luciferase ac- 
tivity as described by the manufacturer (Dual luciferase re- 
porter assay system, Promega Corporation) . All the luciferase 
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values are normalized for renilla lucif erase activity (see Fig- 
ure 8) . 

Example XII: Screening of chemical compounds, organic products 
or peptides interfering with Doll function 

A reporter gene assay* is performed with a similar protocol as 
described in example XI, but scaled down to be performed as a 
high throughput screening. For this purpose colon cancer cell 
lines with mutated and/or constitutively active p-Catenin are 
stably transfected with the Topflash vector described in Example 
XI and Doll cDNA. The established monoclonal population, which 
gives the most reliable and constant reporter gene activity is 
selected for later assays. One day after plating, cells are 
treated with single compounds derived from a chemical or peptide 
library. One to 24 hours later reporter gene activity is meas- 
ured. Compounds found to inhibit reporter gene activity are then 
further characterized for specific activity on the Doll- 
containing transcriptional complex. Alternatively, Wnt pathway 
activity can be measured by detecting mRNA or protein levels of 
a target gene, e.g. myc (He, Sparks et al. 1998) . 

Example XIII: Screening assay based on protein-protein interac- 
tion for compounds inhibiting Doll-Lgs or Doll interaction part- 
ner X 

Doll and its interaction partner or fragments thereof are pro- 
duced and purified e.g. from E.coli cultures (e.g. as described 
in example VIII) . Proteins are tagged e.g. with 6 histidines, S- 
protein, GST or thioredoxin. Small aliquots of the purified pro- 
teins are incubated in an appropriate binding buffer. At this 
point chemical compounds are added to the mixture and their ca- 
pacity to disrupt the protein-protein interaction is monitored 
e.g. by any of the methods described below. Compounds inhibiting 
this interaction are subsequently tested for their specificity 
and in vivo toxicity. Well established methods to monitor pro- 
tein-protein interactions are e.g.: 

• Time resolved fluorometry with lanthanide chelate labels 
(Hemmila I. And Webb S. DDT 2: 373-381 (1997)) 

• Scintillation proximity assay (SPA) (Amersham life Science) 
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Claims 

1 . A nucleic acid sequence coding for a polypeptide being 
part of at least one signaling pathway in insects and 
vertebrates, characterized in that said nucleotide se- 
quence is the "Daughter of Legless* (DOLL) gene as well 
as homologues, fragments, derivatives and functional and 
structural analogs thereof. 

2. The nucleic acid sequence according to claim 1, charac- 
terized in that said signaling pathway is the Wnt sig- 
naling pathway. 

3. The nucleic acid sequence according to claim 2, charac- 
terized in that it is the Drosophlla. melanogaster doll 
gene [ddoll) comprising the nucleotide sequence as shown 
in SEQ. ID. NO 1. 

4. The nucleic acid sequence according to claim 3 # charac- 
terized in that it is coding for a polypeptide compris- 
ing 50% to 100%, preferably 100% sequence identity to a 
fragment or the entire sequence of the Drosophila Doll 
polypeptide as shown in SEQ. ID. NO 6. 

5. The nucleic acid sequence according to claim 2, charac- 
terized in that it is the human doll-1 (hdoll-1) com- 
prising the nucleotide sequence as shown in SEQ ID. NO 
2. 

6. The nucleic acid sequence according to claim 5, charac- 
terized in that it is coding for a polypeptide compris- 
ing 50% to 100%, preferably 100% sequence identity to a 
fragment or the entire sequence of the human Doll 
(hDoll-1) polypeptide as shown in SEQ. ID. NO 7. 
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7. The nucleic acid sequence according to claim 2, charac- 
terized in that it is the human doll-2 gene {hdoll-2) 
having the nucleotide sequence as shown in SKQ ID. NO 3. 

8- The nucleic acid sequence according to claim 7, charac- 
terized in that it is coding for a polypeptide compris- 
ing 50% to 100%, preferably 100% sequence identity to a 
fragment or the entire sequence of the human Doll poly- 
peptide (hDoll-2) as shown in SEQ. ID. NO 8. 

9. The nucleic acid sequence according to claim 2, charac- 
terized in that it is the mouse doll-1 gene {mdoll-1) 
comprising the nucleotide sequence as shown in SEQ ID. 
NO 4. 

10. The nucleic acid sequence according to claim 9, charac- 
terized in that it is coding for a polypeptide compris- 
ing 50% to 100%, preferably 100% sequence identity to a 
fragment or the entire sequence of the mouse Doll 
(mDoll-1) polypeptide as shown in SEQ. ID. NO 9. 

11. The nucleic acid sequence according to claim 2, charac- 
terized in that it. is the mouse doll-2 gene (mdoll-2) 
comprising the nucleotide sequence as shown in SEQ ID. 
NO 5. 

12. The nucleic acid sequence according to claim 11, charac- 
terized in that it is coding for a polypeptide compris- 
ing 50% to 100%, preferably 100% sequence identity to a 
fragment or the entire sequence of the mouse Doll poly- 
peptide (mDoll-2) as shown in SEQ. ID. NO 10. 

13. A polypeptide being part of at least one signaling path- 
way in insects and vertebrates, characterized in that 
said polypeptide is the "Daughter of Legless" (DOLL) 
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protein as well as homologues, fragments, derivatives 
and structural and functional analogs thereof. 

14 . The polypeptide according to claim 13 , characterized in 
that said signaling pathway is the Wnt signaling path- 
way. 

15. The polypeptide . according to claim 14, characterized in 
that said polypeptide is the Drosophila melanogaster 
Doll protein having the amino acid sequence as shown in 
SEQ. ID. NO. 6. 

16. The polypeptide according to claim 15, characterized in 
that said polypeptide has a 50% to 100%, preferably 100% 
sequence identity to a fragment or the entire sequence 
of the Doll polypeptide as shown in SEQ. ID. NO 6. 

17. The polypeptide according to claim 14, characterized in 
that said polypeptide is the human Doll protein (hDOLL- 

1) having the amino acid sequence as shown in SEQ. ID. 
NO. 7. 

18. The polypeptide according to claim 17, characterized in 
that said polypeptide has a 50% to 100%, preferably 100% 
sequence identity to a fragment or the entire sequence 
of the Doll polypeptide as shown in SEQ. ID. NO 7. 

19. The polypeptide according to claim 14, characterized in 
that said polypeptide is the human Doll protein (hDoll- 

2) having the amino acid sequence as shown in SEQ. ID. 
NO. 8. 



20. The polypeptide according to claim 19, characterized in 
that said polypeptide has a 50% to 100%, preferably 100% 
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sequence identity to a fragment or the entire sequence 
of the Doll polypeptide as shown in SEQ. ID. NO 8. 

21. The polypeptide according to claim 14, characterized in 
that said polypeptide is the mouse Doll-1 protein (mDoll- 

1) having the amino acid sequence as shown in SEQ. ID. 
NO. 9. 

22. The polypeptide according to claim 21, characterized in 
that said polypeptide has a 50% to 100% r preferably 100% 
sequence identity to a fragment or the entire sequence of 
the Doll polypeptide as shown in SEQ. ID. NO 9. 

23. The polypeptide according to claim 14, characterized in 
that said polypeptide is the mouse Doll -2 protein (mDoll- 

2) having the amino acid sequence as shown in SEQ. ID. 
10. 

24. The polypeptide according to claim 23, characterized in 
that said polypeptide has a 50% to 100% f preferably 100% 
sequence identity to a fragment or the entire sequence of 
the Doll polypeptide as shown in SEQ. ID. NO 10. 

25. The nucleic acid sequence according to claim 2, charac- 
terized in that said nucleic acid sequence is coding for 
a polypeptide comprising a low overall amino acid se- 
quence identity and a sequence identity of 50% to 100%, 
preferably 100% in conserved domains . 

26. The nucleic acid sequence according to claim 25, charac- 
terized in that said conserved domain is the DHD domain. 

27. The nucleic acid sequence according to claim 26, charac- 
terized in that said conserved domain is the PHD finger. 

28. The nucleic acid sequence according to claim 2, charac- 
terized in that said sequence comprises a fragment of 20 
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to 100 nucleotides in length, preferably from 20 to 60 
nucleotides, and most preferably from 20 to 50 nucleo- 
tides, said fragment being derived from SEQ. ID. NO. 1, 
2, 3, 4 or 5. 

29. The nucleic acid sequence according to claim 28, used as 
a hybridization probe. 



30. A vector comprising a nucleic acid molecule encoding a 
Drosophila or vertebrate doll gene or a fragment thereof, 
selected from the group consisting of SEQ. ID. NO. 1, 2, 
3, 4 or 5. 

31. The vector according to claim 30, selected from the group 
consisting of eucaryotic and procaryotic expression vec- 
tors. 



32. A host cell comprising the vector of claim 31, selected 
from the group consisting of mammalian cells, yeast 
cells, plant cells, insect cells or bacterial cells. 



33. A method for the preparation of Doll proteins, fragments, 
derivatives and analogs thereof, comprising the steps of: 

a) isolating a nucleic acid sequence containing the com- 
plete doll cDNA sequence or part thereof; 

b) recombinant ly expressing doll cDNA or a fragment 
thereof in bacterial, mammalian, plant, yeast or in- 
sect cells; 

c) inducing protein production in said cells; 
d) purifying Doll proteins. 



34. The method according to claim 33, wherein step a) com- 
prises the isolation of a nucleic acid molecule encom- 
passing the doll cDNA from a vertebrate or invertebrate 
cDNA or genomic library. 
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35. The method according to claim 33, wherein step b) com- 
prises the fusion of DNA encoding full-length or trun- 
cated doll to an epitope tag and a cleavage site con- 
tained within an inducible eucaryotic or procaryotic ex- 
pression vector and transforming the appropriated host 
cells with said expression vector. 

36. A chimeric protein comprising a Doll polypeptide fused to 
a heterologous amino acid sequence selected from the 
group consisting of an epitope- tagged sequence, an anti- 
body, gluthathione-S- transferase protein, p- 
galactosidase, and alkaline phosphatase. 



37. The chimeric protein according to claim 36, characterized 
in that the Doll polypeptide is selected from the group 
consisting of dDoll, hDoll-1, hDoll-2, mDoll-1 and mDoll- 
2 . 

38. The polypeptide according to claim 14, characterized in 
that it comprises the full length Doll polypeptide or a 
fragment thereof comprising an antibody- binding site for 
an anti-Doll antibody. 

39. An antibody specifically recognizing a Doll polypeptide, 
said antibody being selected from the group consisting of 
polyclonal and monoclonal antibodies and fragments 
thereof . 

40. An assay for studying diseases induced by a disrupted Wnt 
function or for drug screening comprising the use of or- 
ganisms selected from the group consisting of Drosophila, 
mice, rats, rabbits, chicken, frogs, pigs or sheep, said 
organisms showing increased or reduced or no expression 
of doll or express a mutated Doll polypeptide in at least 
one tissue or organ. 
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41. The assay according to claim 40, characterized in that 
said organisms express the doll gene as a heterologous 
transgene . 

41. The assay according to claim 40, characterized in that 
said doll gene comprises a mutation selected from the 
group consisting of deletions, point mutations, foreign 
DNA insertions and inversions. 

42- Use of a Doll protein, homologue, derivative and fragment 
thereof for the development of a therapeutic and diagnos- 
tic method for the treatment of conditions selected from 
the group consisting of disorders of cell fate, differen- 
tiation or proliferation. 

43. Use of a Doll protein, homologue, derivative and fragment 
thereof for the development of a therapeutic and diagnos- 
tic compound for the diagnosis of conditions selected 
from the group consisting of disorders of cell fate, dif- 
ferentiation or proliferation. 

44.. The use of a Doll protein, homologue, derivative and frag- 
ment thereof according to claim 43, characterized in that 
said therapeutic or diagnostic compound is selected from 
the group consisting of Drosophila and vertebrate Doll 
protein homologue s and fragments thereof and antibodies 
and antibody fragments thereof. 

45. Use of doll nucleic acids, homologues, derivatives and 
fragments thereof for the development of a therapeutic 
and/or diagnostic compound for the treatment of condi- 
tions selected from the group consisting of disorders of 
cell fate, differentiation or proliferation and its ap- 
plication to an individual. 
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46. Use of doll nucleic acids, homologies, derivatives and 

fragments thereof for the development of a therapeutic 
and diagnostic method for the treatment of conditions se- 
lected from the group consisting of disorders of cell 
fate, differentiation or proliferation and its applica- 
tion to an individual. 

47. The use of doll nucleic acids, homologues, derivatives and 

fragments thereof according to claim 45, characterized in 
that said therapeutic or diagnostic compound is selected 
from the group consisting of doll antisense DNA or RNA, 
doll double-stranded RNA and chemical or natural occur- 
ring compounds interfering with doll function. 

48. A peptide comprising a fragment of the Doll polypeptide. 

49. The peptide according to claim 48, comprising 40 to 60 

amino acids in the N- terminal region of hDoll-l, hDoll-2, 
mDoll-1, mDoll-2 and Drosophila Doll. 

50. The peptide according to claim 49, said peptide comprising 

the Doll homology domain (DHD) . 

51. Use of DHD in screening methods for the identification of 
chemical compounds, organic products, polypeptides or peptides 
interfering with Doll function in the Wnt pathway. 

52. The use of DHD according to claim 51 in screening assay 
based on protein-protein interactions. 

53. The use of DHD according to claim 52, wherein said screen- 
ing assay is selected from the group comprising an In vitro 
protein-protein interaction assay and a protein-protein in- 
teraction assay in a host cell. 
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The use of DHD according to claim 51, in a screening assay 
for compounds specifically inhibiting the interaction be- 
tween Doll and another protein. 
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Figure 1: The translated sequence of the Drosdphila 
doll gene. 

dDoll cDNA Sequence Range: 1 to 2448 

ATGACCCACAATCTTGGTATGGCGCCATATCGATTGCCGGGTCCAGCGG.G 
CGGACTCTGTCCGCCCGATTTT7VAGCCGCCGCCTCCCACGGACATCATCT 
CGGCGCCGAGCAATCCGAAGAAGCGGCGAAAAACCTCAAGTGCCGCCAAC 
TCCGCTGCAGCGGTGGCTGCGGCGGCGGCTGCAGCAGCTGCTGCGAATTC 
CATGCAGCAGCAGCAGGCGCCACCCACACCGCAGGATTTGCTGCCCCCTC 
CGCCAATGGGAGGCTTCGGAGACACCATTATTGCCTCGAATCCATTCGAC 
GACAGTCCCCAGGTGTCGGCGATGTCCAGCTCAGCGGCCGCGGCGATGGC 
GGCCATGAATCAGATGGGCGGCGGACCAGGAGGTGGTCACTTTGGCGGCG 
GTGGACCGGGTGGGCACCCGCACTGGGAAGACCGCATGGGCATGGGCGGT 
GGACCTCCTCCCCCGCCTCACATGCATCCCCATATGCACCCGCATCATCC 
AGGCGGACCTATGGGTCACCCACATGGCCCACATCCGCACATGGGTGGTC 
CACCTCCAATGCGAGGAATGAGCCCCATGCACCCCCATCAAATGGGACCG 
GGACCAGGCGTCGGACTACCGCCGCATATGAATCACGGAAGGCCAGGGGG 
ACCTGGTGGTCCTGGAGGACCCGTCCCAATGGGTAGTCCCATGGGTGGAA 
TAGCTGGCATGGGCGGCATGAGCCCAATGGGCGGAATGGGAGGCCCCAGC 
ATATCACCCCATCACATGGGCATGGGTGGTCTGTCGCCCATGGGAGGCGG 
TCCCAACGGACCCAATCCGCGAGCCATGCAGGGTTCACCGATGGGCGGTC 
CGGGGCAGAACTCGCCAATGAACTCACTGCCTATGGGTTCGCCAATGGGC 
AATCCAATTGGCAGCCCGTTGGGCCCTCCCTCGGGACCGGGCCCTGGGAA 
TCCCGGCAATACCGGCGGACCACAGCAGCAACAACAACAACCTCCGCAGC 
CACCGATGAACAACGGGCAGATGGGTCCTCCTCCTCTGCACAGTCCGCTC 
GGAAACGGACCAACGGGTCATGGCAGTCACATGCCTGGAGGACCAATCCC 
AGGACCAGGTCCTGGGCCTGGCGGCCTAGTAGGTCCCGGTGGCATCTCCC 
CCGCGCACGGCAATAACCCGGGTGGTTCTGGGAACAACATGCTCGGCGGG 
AATCCCGGCGGCGGCAACAGCAACAACAACGGAAGCAATACAAGTAACGC 
CAGCAACAACAATCAAAATCCTCACCTCTCGCCAGCAGCCGGACGCCTGG 
GAGTGCCGACGTCGATGCAGTCGAATGGACCTTCGGTATCATCGGTAGCC 
TCCTCATCGGTTCCCTCGCCCGCCACGCCCACGCTCACGCCCACATCGAC 
GGCCACGTCCATGTCCACGTCAGTGCCTACATCCTCGCCAGCGCCGCCCG 
CCATGTCACCGCATCACTCGCTAAACAGCGCCGGCCCGAGTCCGGGCATG 
CCCAACTCGGGACCCAGCCCGCTGCAGTCACCAGCCGGACCCAATGGCCC 
CAATAACAACAACAGCAATAACAACAACGGACCCATGATGGGCCAGATGA 
TCCCGAACGCAGTTCCTATGCAGCACCAGCAGCACATGGGCGGCGGCCCA 
CCTGGCCACGGGCCCGGACCAATGCCCGGAATGGGCATGAACCAAATGCT 
GCCACCGCAGCAACCCTCCCATCTTGGTCCCCCGCATCCGAATATGATGA 
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ACCACCCGCATCATCCGCACCATCATCCTGGCGGACCACCGCCGCACATG 
ATGGGTGGACCCGGAATGCACGGCGGTCCTGCTGGAATGCCTCCTCATAT 
GGGCGGAGGACCTAATCCGCACATGATGGGCGGTCCGCACGGGAACGCGG 
GTCCGCACATGGGCCACGGCCACATGGGTGGAGTACCAGGTCCAGGACCC 
GGACCCGGCGGCATGAACGGACCCCCGCATCCGCACATGTCCCCGCACCA 
CGGACATCCGCATCACCACCACAATCCGATGGGCGGCCCAGGTCCAAATA 
TGTTCGGCGGTGGTGGAGGAGGTCCCATGGGTCCCGGTGGACCGATGGGC 
AACATGGGGCCCATGGGAGGTGGCCCGATGGGCGGCCCTATGGGCGTAGG 
TCCCAAGCCGATGACAATGGGCGGCGGGAAGATGTACCCGCCGGGACAGC 
CAATGGTCTTTAATCCGCAGAACCCGAATGCGCCGCCCATATATCCTTGT 
GGCATGTGCCACAAGGAGGTGAACGACAACGACGAAGCCGTGTTCTGTGA 
ATCCGGTTGTAACTTTTTCTTTCACAGAACCTGTGTTGGCCTGACAGAGG 
CGGCCTTCCAAATGCTCAACAAGGAGGTGTTTGCCGAGTGGTGCTGCGAC 
AAGTGCGTGTCTTCCAAGCATATTCCCATGGTCAAGTTCAAGTGTTGA 



dDoll Protein sequence 1-815 

MTHNLGMAPYRLPGPAGGLCPPDFKPPPPTDIISAPSNPKKRRKTSSAAN 
SAAAVAAAAAAAAAANSMQQQQAPPTPQDLLPPPPMGGFGDTIIASNPFD 
DSPQVSAMSSSAAAAMAAMNQMGGGPGGGHFGGGGPGGHPHWEDRMGMGG 
GPPPPPHMHPHMHPHHPGGPMGHPHGPHPHMGGPPPMRGMSPMHPHQMGP 
GPGVGLPPHMNHGRPGGPGGPGGPVPMGSPMGGIAGMGGMSPMGGMGGPS 
I S PHHMGMGGLS PMGGG PNG PN PRAMQGS PMGG PGQN S PMNS LPMGS PMG 
NPIGSPLGPPSGPGPGNPGNTGGPQQQQQQPPQPPMNNGQMGPPPLHSPL 
GNGPTGHGSHMPGGPIPGPGPGPGGLVGPGGISPAHGNNPGGSGNNMLGG 
NPGGGNSNNNGSNTSNASNNNQNPHLSPAAGRLGVPTSMQSNGPSVSSVA 
SSSVPSPATPTLTPTSTATSMSTSVPTSSPAPPAMSPHHSLNSAGPSPGM 
PNSGPSPLQS PAG PNG PNNNN S NNNNGPMMGQM I PN AVPMQH QQHMGGG P 
PGHGPGPMPGMGMNQMLPPQQPSHLGPPHPNMMNHPHHPHHHPGGPPPHM 
MGGPGMHGGPAGMPPHMGGGPNPHMMGGPHGNAGPHMGHGHMGGVPGPGP 
GPGGMNGPPHPHMSPHHGHPHHHHNPMGGPGPNMFGGGGGGPMGPGGPMG 
NMGPMGGGPMGGPMGVGPKPMTMGGGKMYPPGQPMVFNPQNPNAPPIYPC 
GMCHKEVNDNDEAVFCESGCNFFFHRTCVGLTEAAFQMLNKEVFAEWCCD 
KCVSSKHIPMVKFKC 
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Figure 2 

hDoll-1 cDNA sequence 

GGATCCCCACATGCCCGCCGAGAACTCTCCAGCTCCCGCTTACAAAGTTT 
CCTCGCATGGTGGTGATAGTGGACTGGATGGGTTAGGAGGACCAGGTGTA 
CAACTAGGAAGCCCAGATAAGAAAAAGCGCAAGGCAAATACACAGGGACC 
TTCTTTCCCTCCATTGTCTGAGTATGCTCCACCACCGAATCCAAACTCTG 
ACCATCTAGTGGCTGCTAATCCATTTGATGACAACTATAATACTATTTCC 
TATAAACCACTACCTTCGTCAAATCCATATCTTGGCCCTGGTTATCCTGG 
CTTTGGAGGCTATAGTACATTCAGAATGCCACCTCACGTTCCCCCAAGAA 
TGTCTTCCCCATACTGTGGTCCTTACTCACTCAGGAACCAGCCACACCCA 
TTTCCTCAGAATCCTCTGGGCATGGGTTTTAATCGACCTCATGCTTTTAA 
CTTTGGGCCACATGATAATTCAAGTTTCGGTAATCCATCTTATAATAATG 
CACTAAGTCAGAATGTCAACATGCCTAATCAACATTTTAGACAAAATCCT 
G CT GAAAAT T T C AGT C AAAT T CCT CC AC AG AAT G CT AGCC AAGT TT CT AA 
CCCCGATTTGGCATCTAATTTTGTTCCTGGAAATAATTCAAATTTTACTT 
CTCCGTTAGAATCTAATCATTCTTTTATTCCTCCCCCAAACACTTTTGGT 
CAAGCAAAAGCACCACCCCCAAAACAAGACTTTACTCAAGGAGCAACCAA 
AAACACTAATCAAAATTCCTCTGCTCATCCACCTCACTTGAATATGGATG 
ACACAGTGAATCAGAGTAATATTGAATTAAAAAATGTTAATCGAAACAAT 
GCAGTAAATCAGGAGAACAGCCGTTCAAGTAGCACTGAAGCCACAAACAA 
TAACCCTGCAAATGGGACGCAGAATAAGCCACGACAACCAAGAGGTGCAG 
CAGATGCCTGCACCACAGAAAAAAGCAATAAATCCTCTCTTCACCCAAAC 
CGTCATGGCCATTCGTCTTCTGACCCAGTGTATCCTTGTGGAATTTGTAC 
AAACGAGGTGAACGATGATCAGGATGCCATCTTATGTGAGGCCTCTTGTC 
AGAAATGGTTTCATCGGATCTGTACTGGAATGACTGAAACAGCTTATGGC 
CTCTTAACTGCAGAAGCATCTGCAGTATGGGGCTGTGATACCTGTATGGC 
T G AC AAAG AT GT CCAGT T AAT GC GT AC T AG AG AAACT T T T GGTC C AT CT G 
C AGT GGGC AGT G AT GCT T AAT C AAAG GC ATT AACT AAAGTGGGT TT AT T T 
TCCTGTGCATTGCAGAAGTTCATTGACACAGGATTTTAATGTTTTACATT 
ATTTTTTTAAATGCAT 

hDoll-1 Protein sequence 

MPAENSPAPAYKVSSHGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFP 
PLSE YAPP PNPNS DHLVAAN PFDDNYNT I S YKPLPS SN P YLGPG Y PGFGG 
YSTFRMPPHVPPRMSSPYCGPYSLRNQPHPFPQNPLGMGFNRPHAFNFGP 
HDNSSFGNPSYNNALSQNVNMPNQHFRQNPAENFSQIPPQNASQVSNPDL 
ASNFVPGNNSNFTSPLESNHSFIPPPNT FGQAKAPPPKQDFTQGATKNTN 
QNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA 
NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEV 
NDDQDAI LCEASCQKW FHRI CTGMTETAYGLLT AEASAVWGCDTQMADKD 
VQLMRTRETFGPSAVGSDA 
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hDoll-2 cDNA sequence 

CCCGGGTCCCCCACTCCATGGCCGCCTCGGCGCCGCCCCCACCGGACAAG 
CTGGAGGGAGGTGGCGGCCCCGCACCGCCCCCTGCGCCGCCCAGCACCGG 
GAGGAAGCAGGGCAAGGCCGGTCTGCAAATGAAGAGTCCAGAAAAGAAGC 
GAAGGAAGTCAAATACTCAGGGCCCTGCATACTCACATCTGACGGAGTTT 
GCACCACCCCCAACTCCCATGGTGGATCACCTGGTTGCATCCAACCCTTT 
TGAAGATGACTTCGGAGCCCCCAAAGTGGGGGTTGCAGCCCCTCCATTCC 
TTGGCAGTCCTGTGCCCTTCGGAGGCTTCCGTGTGCAGGGGGGCATGGCG 
GGCCAGGTACCCCCAGGCTACAGCACTGGAGGTGGAGGGGGCCCCCAGCC 
ACTCCGTCGACAGCCACCCCCCTTCCCTCCCAATCCTATGGGCCCTGCTT 
TCAACATGCCCCCCCAGGGTCCTGGCTACCCACCCCCAGGCAACATGAAC 
TTTCCCAGCCAACCCTTCAACCAGCCTCTGGGTCAAAACTTTAGTCCTCC 
CAGTGGGCAGATGATGCCGGGCCCAGTGGGGGGATTTGGTCCCATGATCT 
CACCCACCATGGGACAGCCTCCCAGAGCAGAGCTGGGCCCACCTTCTCTG 
TCCCAACGATTTGCTCAGCCAGGGGCTCCTTTTGGCCCTTCTCCTCTCCA 
GAGACCTGGTCAGGGGCTCCCCAGCCTGCCGCCTAACACAAGTCCCTTTC 
CTGGTCCGGACCCTGGCTTTCCTGGCCCTGGTGGTGAGGATGGGGGGAAG 
CCCTTGAATCCACCTGCTTCTACTGCTTTTCCCCAGGAGCCCCACTCAGG 
CTCCCCGGCTGCTGCTGTTAATGGGAACCAGCCCAGTTTCCCCCCGAACA 
GCAGTGGGCGGGGTGGGGGCACTCCAGATGCCAACAGCTTGGCACCCCCT 
GGCAAGGCAGGTGGGGGCTCCGGGCCCCAGCCTCCCCCAGGCTTGGTGTA 
CCCATGTGGTGCCTGTCGGAGTGAGGTGAACGATGACCAGGATGCCATTC 
TGTGTGAGGCCTCCTGCCAGAAATGGTTCCACCGTGAGTGCACAGGCATG 
ACTGAGAGCGCCTATGGGCTGCTGACCACTGAAGCTTCTGCCGTCTGGGC 
CTGCGATCTCTGCCTCAAGACCAAGGAGATCCAGTCTGTCTACATCCGTG 
AGGGCATGGGGCAGCTGGTGGCTGCTAACGATGGGTGACGCTGGTGAAGT 
GGCCCAGGGAAGTGCACATGTCTCTCCCTGCTCTTCCAGGGTGATTTTTT 
TGATGTTTGGCTCTTGGTCCTTGTTTCCACTGGCTTTCCATCCCCATGGG 
GCAGAAACAGTGGCTCCTGGGAGCAGAAAAGGAATTGAGGTGGGCAGGCA 
GAAGAGCCTGGATTGCTCACTGTTTTGGGAAACTTACATGTTGAGATCT 



hDoll-2 protein sequence 

MAASAPPPPDKLEGGGGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNT 
QGPAYS HLTE FAP P PT PMVDHLVASNP FE DDFGAPKVGVAAP P FLGS PVP 
FGGFRVQGGMAGQV P PGY STGGGGGPQPLRRQP P P FP PN PMGPAFNMP PQ 
GPGYPPPGNMNFPSQPFNQPLGQNFSPPSGQMMPGPVGGFGPMISPTMGQ 
PPRAELGPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPG 
FPGPGGEDGGKPLNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGG 
GTPDANSLAPPGKAGGGSGPQPPPGLVYPCGACRSEVNDDQDAILCEASC 
QKWFHRECTGMTESAYGLLTTEASAVWACDLCLKTKEIQSVYIREGMGQL 
VAANDG 
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Figure 3 

mDoll-1 cDNA sequence 

ATGTCAGCGGAACAGGACAAGGAGCCCATCGCGCTGAAGAGAGTTAGAGG 
TGGTGACAGTGGACTGGATGGGTTAGGAGGGCCCAATATACAACTCGGAA 
GCCCAGATAAGAAAAAACGCAAGGCCAACACACAGGGATCTTCCTTTCCT 
CCGCTGTCAGAGTACGCGCCACCACCGAATCCAAACTCCGACCATCTAGT 
GGCTGCCAATCCGTTCGATGACAGCTACAACACCATTTCCTATAAACCAC 
TGCCTTCATCTAATCCATATCTTGGCCCTGGGTATCCTGGCTTTGGAGGC 
TACAGCACATTCAGAATGCCACCCCACGTCCCTCCAAGAATGTCTTCTCC 
CTACTGTGGTCCTTACTCACTCAGGAATCAGCCCCACCCATTTCCTCAGA 
ATCCGTTGGGCATGGGCTTTAACCGGCCTCATGCTTTTAACTTTGGGCCG 
CATGATAATTCGAATTTTGGAAACCCACCTTATAATAATGTACTGACTCA 
GGACATTAACATGCCCGGTCAGCATTTTAGACAAGGCTCTGCTGAAAACT 
TCAGTCAGATTCCCCCGCAGAATGTTGGCCAAGTGTCTAACCCTGACCTC 
GCATCTAATTTTGCCCCTGGGAATAATTCAAATTTTACCTCTCCGTTAGA 
AACGAATCATTCGTTTATTCCACCCCCAAACGCGTTTGGCCAAGCAAAAG 
CTCCACTTCCCAAACAAGACTTCACTCAAGGGGCAACCAAAACCCCGAAT 
CAGAATTCGTCCACTCACCCACCTCACCTAAATATGGAGGATCCAGTCAA 
T C AG AGT AACGT CGAGT T AAAAAAT GT C AAC AG AAAC AACGT T GT C C AAG 
AGAACAGCCGTTCGGGCAGCGCAGAGGCCACCAACAACCATGCGAATGGG 
ACCCAGAACAAGCCCCGGCAGCCCAGGGGCGCAGCTGACCTGTGCACCCC 
CGACAAAAGCCGCAAGTTCTCCCTGCTCCCCAGCCGGCATGGCCATTCCT 
CCTCTGACCCTGTGTACCCGTGCGGGATTTGTACAAATGAAGTGAATGAC 
GATCAGGACGCCATTCTGTGTGAAGCCTCTTGTCAGAAGTGGTTTCATCG 
CATCTGCACTGGAATGACCGAAACAGCCTACGGGCTCCTGACAGCGGAAG 
CATCCGCAGTGTGGGGCTGTGACACGTGCATGGCTGACAAGGATGTCCAG 
CTCATGCGCACTAGAGAGGCCTTTGGTCCACCTGCCGTGGGCGGCGATGC 
CTAA 



mDoll-1 protein sequence 

MSAEQDKEPIALKRVRGGDSGLDGLGGPNIQLGSPDKKKRKANTQGSSFP 
PLSEYAPPPNPNSDHLVAANPFDDSYNTISYKPLPSSNPYLGPGYPGFGG 
YSTFRMPPHVPPRMSSPYCGPYSLRNQPHPFPQNPLGMGFNRPHAFNFGP 
HDNSNFGNPPYNNVLTQDINMPGQHFRQGSAENFSQIPPQNVGQVSNPDL 
ASNFAPGNNSNFTSPLETNHSFIPPPNAFGQAKAPLPKQDFTQGATKTPN 
QNSSTHPPHLNMEDPVNQSNVELKNVNRNNWQENSRSGSAEATNNHANG 
TQNKPRQPRGAADLCTPDKSRKFSLLPSRHGHSSSDPVYPCGICTNEVND 
DQDAILCEASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQ 
LMRT RE A FG P P AVGG DA 
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Figure 3 



mDoll-2 cDNA sequence 

ATGGCCGCCTCGGCGCCGCCCCCACCGGACAAGCTGGAGGGAGGCAGCGG 
CCCCGCACCGCCCCCCGCGCCGCCCAGCACCGGGAGGAAGCAGGGCAAGG 
CCGGTCTGCAAATGAAGAGCCCTGAAAAGAAGCGAAGAAAGTCCAATACT 
CAGGGTCCTGCATATTCACATCTGACGGAGTTCGCCCCACCCCCGACCCC 
CATGGTGGATCATCTGGTCGCTTCTAACCCTTTTGAGGATGATTTCGGAG 
CCCCTAAAGTGGGGGGCGCAGGCCCTCCGTTCCTCGGCAGTCCGGTGCCC 
TTTGGAGGCTTTCGTGTACAGGGGGGCATGGCAGGCCAGGTACCCCCAAG 
CTACGGCACTGGAGGAGGAGGGGGTCCCCAGCCACTTCGTCGGCAGCCCC 
CTCCTTTTCCCCCCAGCCCTATGGGTCCAGCTTTTAATATGCCCCCTCAG 
GGTCCCTGGGGTACCCCGCCCCCTGGCAACATGAACTTTCCCAGTCAACC 
CTTCAACCAGTCTCTGGGCCAAAACTTTAGCCCACCTGGTGGGCAGGTGA 
TGCCAGGCCCAGTAGGCGGATTTGGTCCCATGATCTCACCGACCATGGGA 
CAGCCTCCTAGAGGGGAGCTGGGTCCTCCTCCTCTCCCCCAACGCTTTAC 
CCAACCAGGAGCACCTTATGGTCCTTCTCTTCAGAGACCTGGTCAGGGAC 
TCACCCAGCTGCCCTCCAACACAAGTCCCTTCCCTGGTCCAGACCCTGGT 
TTTCCTGGACCTGGCGGTGAGGATGGTGGGAAGCCCTTGAACCCACCGGC 
TCCCACCGCCTTTCCCCAGGAAGCACCATTCGGGCTCCCCGCTGCTGCTG 
TCAATGGGAATCAGCCCAGTTTCCCCCCTAGCAGCAGTGGTCGAGGTGGG 
GGCACTCCAGATGCCAACAGTCTGGCACCCCCCGGCAAGGCAGGGGGAGG 
CTCAGGGCCCCAGCCTCCCCCAGGCCTGGTGTACCCCTGCGGTGCCTGCC 
GTAGTGAGGTAAATGATGACCAGGATGCCATTCTGTGTGAGGCCTCCTGC 
CAGAAGTGGTTTCACCGCGAGTGCACCGGCATGACCGAGAGTGCCTACGG 
CCTGCTGACCACCGAGGCCTCTGCCGTCTGGGCCTGTGATCTTTGCCTCA 
AGACCAAGGAGATCCAGTCTGTCTACATCCGAGAGGGCATGGGCCAGTTG 
GT GGCTGCT AACGAT GGGTGA 



mDoll-2 protein sequence 

MAAS APP P P DKLEGGSG P AP P PAP P S TGRKQGKAGLQMKS PE KKRRKS NT 
QG PAY S HLT E FAP PPT PMVDH LV AS N P FE D DFGAPKVGGAG P P FLG S P V P 
FGGFRVQGGMAGQVPPSYGTGGGGGPQPLRRQPPPFPPSPMGPAFNMPPQ 
GPWGTPPPGNMNFPSQPFNQSLGQNFSPPGGQVMPGPVGGFGPMISPTMG 
QPPRGELGPPPLPQRFTQPGAPYGPSLQRPGQGLTQLPSNTSPFPGPDPG 
FPGPGGEDGGKPLNPPAPTAFPQEAPFGLPAAAVNGNQPS.FPPSSSGRGG 
GTPDANSLAPPGKAGGGSGPQPPPGLVYPCGACRSEVNDDQDAILCEASC 
QKW FHRECT GMT E S AYGLLTTE AS AVW AC DLCLKTKE I QS V Y I REGMGQL 
VAANDG 
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dDoll815aa 
DHD PHD 

hDOLU 419 aa 

hDOLL2 406 aa 
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SEQUENCE LISTING PART 
<110> 'The Genetics Company 

<120> A new esential downstream component of the wingless signalling pathway 
<130> P133559 
<160> 10 

<170> Patentln version 3.1 

<210> 1 

<211> 2448 

<212> DNA 

<213> Drosophila melanogaster 

<400> 1 



atgacccaca 


atcttggtat 


ggcgccatat 


cgattgccgg 


gtccagcggg 


cggactctgt 


60 


ccgcccgatt 


ttaagccgcc 


gcctcccacg 


gacatcatct 


cggcgccgag 


caatccgaag 


120 


aagcggcgaa 


aaacctcaag 


tgccgccaac 


tccgctgcag 


cggtggctgc 


ggcggcggct 


180 


gcagcagctg 


ctgcgaattc 


catgcagcag 


cagcaggcgc 


cacccacacc 


gcaggatttg 


240 


ctgccccctc 


egccaatggg 


aggcttcgga 


gacaccatta 


ttgcctcgaa 


tccattcgac 


300 


gacagtcccc 


aggtgtcggc 


gatgtccagc 


tcagcggccg 


cggcgatggc 


ggccatgaat 


360 


cagatgggcg 


gcggaccagg 


aggtggt cac 


tttggcggcg 


gtggaccggg 


tgggcacccg 


420 


cactgggaag 


accgc a t ggg 


catgggcggt 


ggacctcctc 


ccccgcctca 


catgcatccc 


480 


catatgcacc 


cgcaccaccc 


aggcggacct 


atgggtcacc 


cacatggccc 


acatccgcac 


54 0 


atgggtggtc 


cacctccaat 


gcgaggaatg 


agccccatgc 


acccccatca 


aatgggaccg 


600 


ggaccaggcg 


tcggactacc 


gccgcatatg 


aatcacggaa 


ggccaggggg 


acctggtggt 


660 


cctggaggac 


ccgtcccaat 


gggtagtccc 


atgggtggaa 


tagctggcat 


gggcggcatg 


720 


agcccaatgg 


gcggaatggg 


aggccccagc 


atatcacccc 


atcacatggg 


catgggtggt 


780 


ctgtcgccca 


tgggaggcgg 


tcccaacgga 


cccaatccgc 


gagccatgca 


gggttcaccg 


840 


atgggcggtc 


cggggcagaa 


ctcgccaatg 


aactcactgc 


ctatgggttc 


gccaatgggc 


900 


aatccaattg 


gcagcccgtt 


gggCCctccc 


tcgggaccgg 


gccctgggaa 


tcccggcaat 


960 


accggcggac 


cacagcagca 


acaacaacaa 


cctccgcagc 


caccgatgaa 


caacgggcag 


1020 


atgggtcctc 


ctcctctgca 


cagtccgctc 


ggaaacggac 


caacgggtca 


tggcagtcac 


1080 


atgcctggag 


gaccaatccc 


aggaccaggt 


cctgggcctg 


gcggcctagt 


aggtcccggt 


1140 


ggcatctccc 


ccgcgcacgg 


caataacccg 


ggtggttctg 


ggaacaacat 


gctcggcggg 


1200 


aatcccggcg 


gcggcaacag 


caacaacaac 


ggaagcaata 


caagtaacgc 


cagcaacaac 


1260 


aatcaaaatc 


ctcacctctc 


gccagcagcc 


ggacgcctgg 


gagtgccgac 


gtcgatgcag 


1320 


tcgaatggac 


cttcggtatc 


atcggtagcc 


tcctcatcgg 


ttccctcgcc 


cgccacgccc 


1380 
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2 



acgctcacgc 


ccacatcgac 


ggccacgtcc 


atgtccacgt cagtgcctac atcctcgcca 


1440 


gcgccgcccg 


ccatgtcacc 


gcatcactcg 


ctaaacagcg ccggcccgag 


tccgggcatg 


1500 


cccaactcgg 


gacccagccc 


gctgcagtca 


ccagccggac ccaatggccc 


caataacaac 


1560 


aacagcaata 


acaacaacgg 


acccatgatg 


ggccagatga tcccgaacgc agttcctatg 


1620 


cagcaccagc 


agcacatggg 


cggcggccca 


cctggccacg ggcccggacc aatgcccgga 


1680 


atgggcatga 


accaaatgct 


gccaccgcag 


caaccctccc atcttggtcc 


cccgcatccg 


1740 


aatatgatga 


accacccgca 


tcatccgcac 


catcatcctg gcggaccacc 


gccgcacatg 


1800 


atgggtggac 


ccggaatgca 


cggcggtcct 


gctggaatgc ctcctcatat 


9ggcggagga 


1860 


cctaatccgc 


acatgatggg 


cggtccgcac 


gggaacgcgg gtccgcacat 


gggccacggc 


1920 


cacatgggtg 


gagtaccagg 


tccaggaccc 


ggacccggcg gcatgaacgg 


acccccgcat 


1980 


ccgcacatgt 


ccccgcacca 


cggacatccg 


catcaccacc acaatccgat 


gggcggccca 


2040 


ggtccaaata 


tgttcggcgg 


tggtggagga 


ggtcccatgg gtcccggtgg 


accgatgggc 


2100 


aacatggggc 


ccatgggagg 


tggcccgatg 


ggcggcccta tgggcgtagg 


tcccaagccg 


2160 


atgacaatgg 


gcggcgggaa 


gatgtacccg 


ccgggacagc caatggtctt 


taatccgcag 


2220 


aacccgaatg 


cgccgcccat 


atatccttgt 


ggcatgtgcc acaaggaggt 


gaacgacaac 


2280 


gacgaagccg 


tgttctgtga 


atccggttgt 


aactttttct ttcacagaac 


ctgtgttggc 


2340 


ctgacagagg 


cggccttcca 


aatgctcaac 


aaggaggtgt ttgccgagtg gtgctgcgac 


2400 


aagtgcgtgt 


cttccaagca 


tattcccatg 


gtcaagttca agtgttga 




2448 


<210> 2 
<211> 1366 
<212> DNA 
<213> Human 










<40t>> 2 
ggatccccac 


atgcccgccg 


agaactctcc 


agctcccgct tacaaagttt 


cctcgcatgg 


60 


tggtgatagt ggactggatg 


ggttaggagg 


accaggtgta caactaggaa gcccagataa 


120 


gaaaaagcgc 


aaggcaaata 


cacagggacc 


ttctttccct ccattgtctg agtatgctcc 


180 


accaccgaat 


ccaaactctg 


accatctagfc 


ggctgctaat ccatttgatg acaactataa 


240 


tactatttcc 


tataaaccac 


taccttcgtc 


aaatccatat cttggccctg gttatcctgg 


300 


ctttggaggc 


tatagtacat 


tcagaatgcc 


acctcacgtt cccccaagaa tgtcttcccc 


360 


atactgtggt 


ccttactcac 


tcaggaacca 


gccacaccca tttcctcaga atcctctggg 


420 


catgggtttt aatcgacctc 


atgcttttaa 


ctttgggcca catgataafct 


caagtttcgg 


480 


taatccatct 


tataataatg 


cactaagtca 


gaatgtcaac atgcctaatc 


aacattttag 


540 


acaaaatccfc gctgaaaatt 


tcagtcaaat 


tcctccacag aatgctagcc 


aagtttctaa 


600 
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ccccgatttg gcatctaatt ttgttcctgg aaataattca aattttactt ctccgttaga 660 

atctaatcat tcttttattc ctcccccaaa cacttttggt caagcaaaag caccaccccc 720 

aaaacaagac tttactcaag gagcaaccaa aaacactaat caaaattcct ctgctcatcc 78 0 

acctcacttg aatatggatg acacagtgaa tcagagtaat attgaattaa aaaatgttaa 840 

tcgaaacaafc gcagtaaatc aggagaacag ccgfctcaagt agcactgaag ccacaaacaa 900 

taaccctgca aatgggacgc agaataagcc acgacaacca agaggtgcag cagatgcctg 960 

caccacagaa aaaagcaata aatcctctct tcacccaaac cgtcatggcc attcgtcttc 1020 

tgacccagtg tatccttgtg gaatttgtac aaacgaggtg aacgatgatc aggatgccat 10B0 

cttatgtgag gcctcttgtc agaaatggtt tcatcggatc tgtactggaa tgactgaaac 1140 

agcttatggc ctcttaactg cagaagcatc tgcagtatgg ggctgtgata cctgtatggc 1200 

tgacaaagat gtccagttaa tgcgtactag agaaactttt ggtccatctg cagtgggcag 1260 

tgatgcttaa tcaaaggcat taactaaagt gggtttattt tcctgtgcat tgcagaagtt 1320 

cattgacaca ggattttaat gttttacatt atttttttaa atgcat 1366 

<210> 3 
<211> 1449 
<212> DKA 
<213> Human 

<400> 3 

cccgggtccc ccactccatg gccgcctcgg cgccgccccc accggacaag ctggagggag 60 

gtggcggccc cgcaccgccc ccbgcgccgc ccagcaccgg gaggaagcag ggcaaggccg 120 

gtctgcaaat gaagagtcca gaaaagaagc gaaggaagtc aaatactcag ggccctgcat 180 

actcacatct gacggagttt gcaccacccc caactcccat ggtggatcac ctggttgcat 240 

ccaacccttt tgaagatgac ttcggagccc ccaaagtggg ggttgcagcc cctccattcc 300 

ttggcagtcc tgtgcccttc ggaggcttcc gtgtgcaggg gggcatggcg ggccaggtac 360 

ccccaggcta cagcactgga ggtggagggg gcccccagcc actccgtcga cagccacccc 420 

ccttccctcc caatcctatg ggccctgctb tcaacatgcc cccccagggt cctggctacc 480 

cacccccagg caacatgaac tttcccagcc aacccttcaa ccagcctctg ggtcaaaact 540 

ttagtcctcc cagtgggcag atgatgccgg gcccagtggg gggatttggt cccatgatct 600 

cacccaccat gggacagcct cccagagcag agctgggccc accttctctg tcccaacgat 660 

ttgctcagcc aggggctcct tttggccctt ctccfcctcca gagacctggt caggggctcc 720 

ccagcctgcc gcctaacaca agtccctttc ctggtccgga ccctggcttt cctggccctg 780 

gtggtgagga tggggggaag cccttgaatc cacctgcttc tactgctttt ccccaggagc 840 

cccactcagg ctccccggct gctgctgtta atgggaacca gcccagtttc cccccgaaca 900 
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gcagtgggcg 


gggtgggggc 


actccagatg 


ccaacagctt 


ggcaccccct ggcaaggcag 


960 


gtgggggctc 


cgggccccag 


cctcccccag 


gcttggtgta 


cccatgtggt gcctgtcgga 


1020 


gtgaggtgaa 


cgatgaccag 


gatgccattc 


tgtgtgaggc 


ctcctgccag aaatggttcc 


1080 


accgtgagtg 


cacaggcatg 


actgagagcg 


cctatgggct 


gctgaccact gaagcttctg 


1140 


ccgtctgggc 


ctgcgatctc 


tgcctcaaga 


ccaaggagat 


ccagtctgtc tacatccgtg 


1200 


agggcatggg 


gcagctggtg 


gctgctaacg 


atgggtgacg 


ctggtgaagt ggcccaggga 


1260 


agtgcacatg 


tctctccctg 


ctcttccagg 


gtgatttttt 


tgatgtttgg ctcttggtcc 


1320 


ttgtttccac 


tggctttcca 


tccccatggg 


gcagaaacag 


tggctcctgg gagcagaaaa 


1380 


ggaattgagg 


tgggcaggca 


gaagagcctg 


gattgctcac 


tgttttggga aacttacatg 


1440 


ttgagatct 










1443 


<210> 4 
<211> 1254 
<212> DNA 
<213> Mouse 










<400> 4 
atgtcagcgg 


aacaggacaa 


ggagcccatc 


gcgctgaaga 


gagttagagg tggtgacagt 


60 


ggactggatg 


ggttaggagg 


gcccaatata 


caactcggaa 


gcccagataa gaaaaaacgc 


120 


aaggccaaca 


cacagggatc 


ttcctfcfccct 


ccgctgtcag 


agbacgcgcc accaccgaat 


180 


ccaaactccg 


accatctagt 


ggctgccaat 


ccgttcgatg 


acagctacaa caccatttcc 


240 


tataaaccac 


tgccttcatc 


taatccatat 


cttggccctg 


ggtatcctgg ctttggaggc 


300 


tacagcacat 


tcagaatgcc 


accccacgtc 


cctccaagaa 


tgtcttctcc ctactgtggt 


360 


ccttactcac 


tcaggaafcca 


gccccaccca 


tttcctcaga 


atccgttggg catgggcttt 


420 


aaccggcctc 


atgcttttaa 


ctttgggccg 


catgataatt 


cgaattttgg aaacccacct 


480 


tataataatg 


tactgactca 


ggacattaac 


atgcccggtc 


agcattttag acaaggctct 


540 


gctgaaaact 


tcagtcagat 


tcccccgcag 


aatgttggcc 


aagtgtctaa ccctgacctc 


600 


gcatctaatt 


ttgcccctgg 


gaataattca 


aattttacct 


ctccgttaga aacgaatcat 


660 


tcgtttattc 


cacccccaaa 


cgcgtttggc 


caagcaaaag 


ctccacttcc caaacaagac 


720 


ttcactcaag 


gggcaaccaa 


aaccccgaat 


cagaattcgt 


ccactcaccc acctcaccta 


780 


aatatggagg 


atccagtcaa 


tcagagtaac 


gtcgagttaa 


aaaatgtcaa cagaaacaac 


840 


gttgtccaag 


agaacagccg 


ttcgggcagc 


gcagaggcca 


ccaacaacca tgcgaatggg 


900 


acccagaaca 


agccccggca 


gcccaggggc 


gcagctgacc 


tgtgcacccc cgacaaaagc 


960 


cgcaagttct 


ccctgctccc 


cagccggcat 


ggccattcct 


cctctgaccc tgtgtacccg 


1020 


tgcgggattt 


gtacaaatga 


agtgaatgac 


gatcaggacg 


ccattctgtg tgaagcctct 


1080 
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tgtcagaagt ggtttcatcg 


catctgcact ggaatgaccg 


aaacagccta cgggctcctg 


1140 


acagcggaag 


catccgcagt 


gtggggctgt gacacgtgca 


tggctgacaa ggatgtccag 


1200 


ctcatgcgca 


ctagagaggc 


ctttqqtcca cctqccqtqq 


qcqqcqatqc ctaa 


1254 


<210> 5 
<211> 1221 
<212> DNA 
<213> Mouse 








<400> 5 
atggccgcct 


cggcgccgcc 


cccaccggac aagctggagg 


gaggcagcgg ccccgcaccg 


60 


ccccccgcgc 


cgcccagcac 


cgggaggaag cagggcaagg 


ccggtctgca aatgaagagc 


120 


cctgaaaaga 


agcgaagaaa 


gtccaatact cagggtcctg 


catabtcaca tctgacggag 


180 


ttcgccccac 


ccccgacccc 


catggtggat catctggtcg 


cttctaaccc ttttgaggat 


240 


gatttcggag 


cccctaaagt 


ggggggcgca ggccctccgt 


tcctcggcag tccggtgccc 


300 


tttggaggct 


ttcgtgtaca 


ggggggcatg gcaggccagg 


tacccccaag ctacggcact 


360 


ggaggaggag 


ggggt cccca. 


gccacttcgt cggcagcccc 


ctccttttcc ccccagccct 


420 


atgggtccag 


cttttaatat 


gccccctcag ggtccctggg 


gtaccccgcc ccctggcaac 


480 


atgaactttc 


ccagtcaacc 


cttcaaccag tctctgggcc 


aaaactttag cccacctggt 


540 


gggcaggtga 


tgccaggccc 


agtaggcgga tttggtccca 


tgatctcacc gaccatggga 


600 


cagcctccta 


gaggggagct 


gggtcctcct cctctccccc 


aacgctttac ccaaccagga 


660 


gcaccttatg 


gtccttctct 


tcagagacct ggtcagggac 


tcacccagct gccctccaac 


720 


acaagtccct 


tccctggtcc 


agaccctggt tttcctggac 


ctggcggtga ggatggtggg 


780 


aagcccttga 


acccaccggc 


tcccaccgcc tfctccccagg 


aagcaccatt cgggctcccc 


840 


gctgctgctg 


tcaatgggaa 


tcagcccagt ttccccccta 


gcagcagtgg tcgaggtggg 


900 


ggcactccag 


atgccaacag 


tctggcaccc cccggcaagg 


cagggggagg ctcagggccc 


960 


cagcctcccc 


caggcctggt 


gtacccctgc ggtgcctgcc 


gtagtgaggt aaatgatgac 


1020 


caggatgcca 


ttctgtgtga 


ggcctcctgc cagaagtggt 


ttcaccgcga gtgcaccggc 


1080 


atgaccgaga 


gtgcctacgg 


cctgctgacc accgaggcct 


ctgccgtctg ggcctgtgat 


1140 


ctttgcctca 


agaccaagga 


gatccagtct gtctacatcc 


gagagggcat gggccagttg 


1200 


gtggctgcta 


acgatgggtg 


a 




1221 



<210> 6 

<211> 815 

<212> PRT 

<213> Drosophila melanogaster 



<400> 6 
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Met Thr His Asn Leu Gly Met Ala Pro Tyr Arg Leu Pro Gly Pro Ala 
15 10 15 



Gly Gly Leu Cys Pro Pro Asp Phe Lys Pro Pro Pro Pro Thr Asp He 
20 25 30 



He Ser Ala Pro Ser Asn Pro Lys Lys Arg Arg Lys Thr Ser Ser Ala 
35 40 45 



Ala Asn Ser Ala Ala Ala Val Ala Ala Ala Ala Ala Ala Ala Ala Ala 
50 55 60 



Ala Asn Ser Met Gin Gin Gin Gin Ala Pro Pro Thr Pro Gin Asp Leu 
65 70 75 80 



Leu Pro Pro Pro Pro Met Gly Gly Phe Gly Asp Thr He He Ala Ser 
85 90 95 



Asn Pro Phe Asp Asp Ser Pro Gin Val Ser Ala Met Ser Ser Ser Ala 
100 105 110 



Ala Ala Ala Met Ala Ala Met Asn Gin Met Gly Gly Gly Pro Gly Gly 
115 120 125 



Gly His Phe Gly Gly Gly Gly Pro Gly Gly His Pro His Trp Glu Asp 
130 135 140 



Arg Met Gly Met Gly Gly Gly Pro Pro Pro Pro Pro His Met His Pro 
145 150 155 160 



His Met His Pro His His Pro Gly Gly Pro Met Gly His Pro His Gly 
165 170 175 



Pro His Pro His Met Gly Gly Pro Pro Pro Met Arg Gly Met Ser Pro 
180 185 190 



Met His Pro His Gin Met Gly Pro Gly Pro Gly Val Gly Leu Pro Pro 
195 200 205 



His Met Asn His Gly Arg Pro Gly Gly Pro Gly Gly Pro Gly Gly Pro 
210 215 220 



Val Pro Met Gly Ser Pro Met Gly Gly He Ala Gly Met Gly Gly Met 
225 230 235 240 



Ser Pro Met Gly Gly Met Gly Gly Pro Ser lie Ser Pro His His Met 
245 250 255 
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Gly Met Gly Gly Leu Ser Pro Met Gly Gly Gly Pro Asn Gly Pro Asn 
260 265 270 



Pro Arg Ala Met Gin Gly Ser Pro Met Gly Gly Pro Gly Gin Asn Ser 
275 280 285 



Pro Met Asn Ser Leu Pro Met Gly Ser Pro Met Gly Asn Pro lie Gly 
290 295 300 



Ser Pro Leu Gly Pro Pro Ser Gly Pro Gly Pro Gly Asn Pro Gly Asn 
305 310 315 320 



Thr Gly Gly Pro Gin Gin Gin Gin Gin Gin Pro Pro Gin Pro Pro Met 

325 330 335 



Asn Asn Gly Gin Met Gly Pro Pro Pro Leu His Ser Pro Leu Gly Asn 
340 345 350 



Gly Pro Thr Gly His Gly Ser His Met Pro Gly Gly Pro lie Pro Gly 
355 360 365 



Pro Gly Pro Gly Pro Gly Gly Leu Val Gly Pro Gly Gly lie Ser Pro 
370 375 380 



Ala His Gly Asn Asn Pro Gly Gly Ser Gly Asn Asn Met Leu Gly Gly 
385 390 395 400 



Asn Pro Gly Gly Gly Asn Ser Asn Asn Asn Gly Ser Asn Thr Ser Asn 
405 410 415 



Ala Ser Asn Asn Asn Gin Asn Pro His Leu Ser Pro Ala Ala Gly Arg 
420 425 430 



Leu Gly Val Pro Thr Ser Met Gin Ser Asn Gly Pro Ser Val Ser Ser 
435 440 445 



Val Ala Ser Ser Ser Val Pro Ser Pro Ala Thr Pro Thr Leu Thr Pro 
450 455 460 



Thr Ser Thr Ala Thr Ser Met Ser Thr Ser Val Pro Thr Ser Ser Pro 
465 470 475 480 



Ala Pro Pro Ala Met Ser Pro His His Ser Leu Asn Ser Ala Gly Pro 
485 490 495 



Ser Pro Gly Met Pro Asn Ser Gly Pro Ser Pro Leu Gin Ser Pro Ala 
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500 505 510 



Gly Pro Asn Gly Pro Asn Asn Asn Asia Ser Asn Asn Asn Asn Gly Pro 
515 520 525 



Met Met Gly Gin Met lie Pro Asn Ala Val Pro Met Gin His Gin Gin 
530 535 540 



His Met Gly Gly Gly Pro Pro Gly His Gly Pro Gly Pro Met Pro Gly 
545 550 555 560 



Met Gly Met Asn Gin Met Leu Pro Pro Gin Gin Pro Ser His Leu Gly 
565 570 575 



Pro Pro His Pro Asn Met Met Asn His Pro His His Pro His His His 
580 585 590 



Pro Gly Gly Pro Pro Pro His Met Met Gly Gly Pro Gly Met His Gly 
595 600 605 



Gly Pro Ala Gly Met Pro Pro His Met Gly Gly Gly Pro Asn Pro His 
610 615 620 



Met Met Gly Gly Pro His Gly Asn Ala Gly Pro His Met Gly His Gly 
625 630 635 640 



His Met Gly Gly Val Pro Gly Pro Gly Pro Gly Pro Gly Gly Met Asn 
645 650 655 



Gly Pro Pro His Pro His Met Ser Pro His His Gly His Pro His His 
660 665 670 



His His Asn Pro Met Gly Gly Pro Gly Pro Asn Met Phe Gly Gly Gly 
675 680 685 



Gly Gly Gly Pro Met Gly Pro Gly Gly Pro Met Gly Asn Met Gly Pro 
690 695 700 



Met Gly Gly Gly Pro Met Gly Gly Pro Met Gly Val Gly Pro Lys Pro 
705 710 715 720 



Met Thr Met Gly Gly Gly Lys Met Tyr Pro Pro Gly Gin Pro Met Val 
725 730 735 



Phe Asn Pro Gin Asn Pro Asn Ala Pro Pro lie Tyr Pro Cys Gly Met 
740 745 750 
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Cys His Lys Glu Val Asn Asp Asn Asp Glu Ala Val Phe Cys Ql u Ser 
755 760 765 



Gly Cys Asn Phe Phe Phe His Arg Thr Cys Val Gly Leu Thr Glu Ala 
770 775 780 



Ala Phe Gin Met Leu Asn Lys Glu Val Phe Ala Glu Trp Cys Cys Asp 
785 790 795 800 



Lys Cys Val Ser Ser Lys His He Pro Met Val Lys Phe Lys Cys 
805 810 815 



<210> 7 

<211> 419 

<212> PRT 

<213> Human 

<400> 7 

Met Pro Ala Glu Asn Ser Pro Ala Pro Ala Tyr Lys Val Ser Ser His 
1 5 10 15 



Gly Gly Asp Ser Gly Leu Asp Gly Leu Gly Gly Pro Gly Val Gin Leu 
20 25 30 



Gly Ser Pro Asp Lys Lys Lys Arg Lys Ala Asn Thr Gin Gly Pro Ser 
35 40 45 



Phe Pro Pro Leu Ser Glu Tyr Ala Pro Pro Pro Asn Pro Asn Ser Asp 
50 55 60 



His Leu Val Ala Ala Asn Pro Phe Asp Asp Asn Tyr Asn Thr lie Ser 
65 70 75 80 



Tyr Lys Pro Leu Pro Ser Ser Asn Pro Tyr Leu Gly Pro Gly Tyr Pro 
85 90 95 



Gly Phe Gly Gly Tyr Ser Thr Phe Arg Met Pro Pro Hi3 Val Pro Pro 
100 105 110 



Arg Met Ser Ser Pro Tyr Cys Gly Pro Tyr Ser Leu Arg Asn Gin Pro 
115 120 125 



His Pro Phe Pro Gin Asn Pro Leu Gly Met Gly Phe Asn Arg Pro His 
130 135 140 



Ala Phe Asn Phe Gly Pro His Asp Asn Ser Ser Phe Gly Asn Pro Ser 
145 150 155 160 
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Tyr Asn Asn Ala Leu Ser Gin Asn Val Asn Met Pro Asn Gin His Phe 
165 170 175 



Arg Gin Asn Pro Ala Glu Asn Phe Ser Gin lie Pro Pro Gin Asn Ala 
180 185 190 



Ser Gin Val Ser Asn Pro Asp Ireu Ala Ser Asn Phe Val Pro Gly Asn 
195 200 205 



Asn Ser Asn Phe Thr Ser Pro Leu Glu Ser Asn His Ser Phe lie Pro 
210 215 220 



Pro Pro Asn Thr Phe Gly Gin Ala Lys Ala Pro Pro Pro Lys Gin Asp 
225 230 235 240 



Phe Thr Gin Gly Ala Thr Lys Asn Thr Asn Gin Asn Ser Ser Ala His 
245 250 255 



Pro Pro His Leu Asn Met Asp Asp Thr Val Asn Gin Ser Asn lie Glu 
260 265 270 



Leu Lys Asn Val Asn Arg Asn Asn Ala Val Asn Gin Glu Asn Ser Arg 
275 280 285 



Ser Ser Ser Thr Glu Ala Thr Asn Asn Asn Pro Ala Asn Gly Thr Gin 
290 295 300 



Asn Lys Pro Arg Gin Pro Arg Gly Ala Ala Asp Ala Cys Thr Thr Glu 
305 310 315 320 



Lys Ser Asn Lys Ser Ser Leu His Pro Asn Arg His Gly His Ser Ser 
325 330 335 



Ser Asp Pro Val Tyr Pro Cys Gly lie Cys Thr Asn Glu Val Asn Asp 
340 345 350 



Asp Gin Asp Ala lie Leu Cys Glu Ala Ser Cys Gin Lys Trp Phe His 
355 360 365 



Arg lie Cys Thr Gly Met Thr Glu Thr Ala Tyr Gly Leu Leu Thr Ala 
370 375 380 



Glu Ala Ser Ala Val Trp Gly Cys Asp Thr Cys Met Ala Asp Lys Asp 
385 390 395 400 



Val Gin Leu Met Arg Thr Arg Glu Thr Phe Gly Pro Ser Ala Val Gly 
405 410 415 
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Ser Asp Ala 



<210> 8 

<211> 406 

<212> PRT 

<213> Human 

<400> 8 



Met Ala Ala Ser Ala Pro Pro Pro Pro Asp Lys Leu Glu Gly Gly Gly 
1 5 10 15 



Gly Pro Ala Pro Pro Pro Ala Pro Pro Ser Thr Gly Arg Lys Gin Gly 
20 25 30 



Lys Ala Gly Leu Gin Met Lys Ser Pro Glu Lys Lys Arg Arg Lys Ser 
35 40 45 



Asa Thr Gin Gly Pro Ala Tyr Ser His Leu Thr Glu Phe Ala Pro Pro 
50 55 60 



Pro Thr Pro Met Val Asp His Leu Val Ala Ser Asn Pro Phe Glu Asp 
65 70 75 . 80 



Asp Phe Gly Ala Pro Lys Val Gly Val Ala Ala Pro Pro Phe Leu Gly 
85 90 95 



Ser Pro Val Pro Phe Gly Gly Phe Arg Val Gin Gly Gly Met Ala Gly 
100 105 110 



Gin Val Pro Pro Gly Tyr Ser Thr Gly Gly Gly Gly Gly Pro Gin Pro 
115 120 125 



Leu Arg Arg Gin Pro Pro Pro Phe Pro Pro Asn Pro Met Gly Pro Ala 
130 135 140 



Phe Asn Met Pro Pro Gin Gly Pro Gly Tyr Pro Pro Pro Gly Asn Met 
145 150 155 160 



Asn Phe Pro Ser Gin Pro Phe Asn Gin Pro Leu Gly Gin Asn Phe Ser 
165 170 175 



Pro Pro Ser Gly Gin Met Met Pro Gly Pro Val Gly Gly Phe Gly Pro 
180 185 190 



Met lie Ser Pro Thr Met Gly Gin Pro Pro Arg Ala Glu Leu Gly Pro 
195 200 205 
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Pro Ser Leu Ser Gin Arg Phe Ala Gin Pro Gly Ala Pro Phe Gly Pro 
210 215 220 



Ser Pro Leu Gin Arg Pro Gly Gin Gly Leu Pro Ser Leu Pro Pro Asn 
225 230 235 240 



Thr Ser Pro Phe Pro Gly Pro Asp Pro Gly Phe Pro Gly Pro Gly Gly 
245 250 255 



Glu Asp Gly Gly Lys Pro Leu Asn Pro Pro Ala Ser Thr Ala Phe Pro 
260 265 270 



Gin Glu Pro His Ser Gly Ser Pro Ala Ala Ala Val Asn Gly Asn Gin 
275 280 285 



Pro Ser Phe Pro Pro Asn Ser Ser Gly Arg Gly Gly Gly Thr Pro Asp 
290 295 300 



Ala Asn Ser Leu Ala Pro Pro Gly Lys Ala Gly Gly Gly Ser Gly Pro 
305 310 315 320 



Gin Pro Pro Pro Gly Leu Val Tyr Pro Cys Gly Ala Cys Arg Ser Glu 
325 330 335 



Val Asn Asp Asp Gin Asp Ala lie Leu Cys Glu Ala Ser Cys Gin Lys 
340 345 350 



Trp Phe His Arg Glu Cys Thr Gly Met Thr Glu Ser Ala Tyr Gly Leu 
355 360 365 



Leu Thr Thr Glu Ala Ser Ala Val Trp Ala Cys Asp Leu Cys Leu Lys 
370 375 380 



Thr Lys Glu He Gin Ser Val Tyr He Arg Glu Gly Met Gly Gin Leu 
385 390 395 400 



Val Ala Ala Asn Asp Gly 
405 



<210> 9 

<211> 417 

<212> PRT 

<213> Mouse 

<400> 9 

Met Ser Ala Glu Gin Asp Lys Glu Pro He Ala Leu Lys Arg Val Arg 
15 10 15 
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Gly Gly Asp Ser Gly Leu Asp Gly Leu Gly Gly Pro Asn lie Gin Leu 
20 25 30 



Gly Ser Pro Asp Lys Lys Lys Arg Lys Ala Asn Thr Gin Gly Ser Ser 
35 40 45 



Phe Pro Pro Leu Ser Glu Tyr Ala Pro Pro Pro Asn Pro Asn Ser Asp 
50 55 60 



His Leu Val Ala Ala Asn Pro Phe Asp Asp Ser Tyr Asn Thr lie Ser 
65 70 75 80 



Tyr Lys Pro Leu Pro Ser Ser Asn Pro Tyr Leu Gly Pro Gly Tyr Pro 
85 90 95 



Gly Phe Gly Gly Tyr Ser Thr Phe Arg Met Pro Pro His Val Pro Pro 
100 105 110 



Arg Met Ser Ser Pro Tyr Cys Gly Pro Tyr Ser Leu Arg Asn Gin Pro 
115 120 125 



His Pro Phe Pro Gin Asn Pro Leu Gly Met Gly Phe Asn Arg Pro His 
130 135 140 



Ala Phe Asn Phe Gly Pro His Asp Asn Ser Asn Phe Gly Asn Pro Pro 
145 150 155 160 



Tyr Asn Asn Val Leu Thr Gin Asp lie Asn Met Pro Gly Gin His Phe 
165 170 175 



Arg Gin Gly Ser Ala Glu Asn Phe Ser Gin lie Pro Pro Gin Asn Val 
1B0 185 190 



Gly Gin Val Ser Asn Pro Asp Leu Ala Ser Asn Phe Ala Pro Gly Asn 
195 200 205 



Asn Ser Asn Phe Thr Ser Pro Leu Glu Thr Asn His Ser Phe He Pro 
210 215 220 



Pro pro Asn Ala Phe Gly Gin Ala Lys Ala Pro Leu Pro Lys Gin Asp 
225 230 235 240 



Phe Thr Gin Gly Ala Thr Lys Thr Pro Asn Gin Asn Ser Ser Thr His 
245 250 255 



Pro Pro His Leu Asn Met Glu Asp Pro Val Asn Gin Ser Asn Val Glu 
260 265 270 
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Leu Lys Asn Val Asn Arg Asn Asn Val Val Gin Glu Asn Ser Arg Ser 
275 280 285 



Gly Ser Ala Glu Ala Thr Asn Asn His Ala Asn Gly Thr Gin Asn Lys 
290 295 300 



Pro Arg Gin Pro Arg Gly Ala Ala Asp Leu Cys Thr Pro Asp Lys Ser 
305 310 315 320 



Arg Lys Phe Ser Leu Leu Pro Ser Arg His Gly His Ser Ser Ser Asp 
325 330 335 



Pro Val Tyr Pro Cys Gly lie Cys Thr Asn Glu Val Asn Asp Asp Gin 
340 345 350 



Asp Ala lie Leu Cys Glu Ala Ser Cys Gin Lys Trp Phe His Arg lie 
355 360 365 



Cys Thr Gly Met Thr Glu Thr Ala Tyr Gly Leu Leu Thr Ala Glu Ala 
370 375 380 



Ser Ala Val Trp Gly Cys Asp Thr Cys Met Ala Asp Lys Asp Val Gin 
385 390 395 400 



Leu Met Arg Thr Arg Glu Ala Phe Gly Pro Pro Ala Val Gly Gly Asp 
405 410 415 



Ala 



<210> 10 

<211> 406 

<212> PRT 

<213> Mouse 

<400> 10 

Met Ala Ala Ser Ala Pro Pro Pro Pro Asp Lys Leu Glu Gly Gly Ser 
1 5 10 15 



Gly Pro Ala Pro Pro Pro Ala Pro Pro Ser Thr Gly Arg Lys Gin Gly 
20 25 30 



Lys Ala Gly Leu Gin Met Lys Ser Pro Glu Lys Lys Arg Arg Lys Ser 
35 40 45 



Asn Thr Gin Gly Pro Ala Tyr Ser His Leu Thr Glu Phe Ala Pro Pro 
50 55 60 
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Pro Thr Pro Met Val Asp His Leu Val Ala Ser Asn Pro Phe Glu Asp 
65 70 75 80 



Asp Phe Gly Ala Pro Lys Val Gly Gly Ala Gly Pro Pro Phe Leu Gly 
85 90 95 



Ser Pro Val Pro Phe Gly Gly Phe Arg Val Gin Gly Gly Met Ala Gly 
100 . 105 110 



Gin Val Pro Pro Ser Tyr Gly Thr Gly Gly Gly Gly Gly Pro Gin Pro 
115 120 125 



Leu Arg Arg Gin Pro Pro Pro Phe Pro Pro Ser Pro Met Gly Pro Ala 
130 135 140 



Phe Asn Met Pro Pro Gin Gly Pro Trp Gly Thr Pro Pro Pro Gly Asn 
145 ISO 155 160 



Met Asn Phe Pro Ser Gin Pro Phe Asn Gin Ser Leu Gly Gin Asn Phe 
165 170 175 



Ser Pro Pro Gly Gly Gin Val Met Pro Gly Pro Val Gly Gly Phe Gly 
180 185 190 



Pro Met He Ser Pro Thr Met Gly Gin Pro Pro Arg Gly Glu Leu Gly 
195 200 205 



Pro Pro Pro Leu Pro Gin Arg Phe Thr Gin Pro Gly Ala Pro Tyr Gly 
210 215 220 



Pro Ser Leu Gin Arg Pro Gly Gin Gly Leu Thr Gin Leu Pro Ser Asn 
225 230 235 240 



Thr Ser Pro Phe Pro Gly Pro Asp Pro Gly Phe Pro Gly Pro Gly Gly 
245 250 255 



Glu Asp Gly Gly Lys Pro Leu Asn Pro Pro Ala Pro Thr Ala Phe Pro 
260 265 270 



Gin Glu Ala Pro Phe Gly Leu Pro Ala Ala Ala Val Asn Gly Asn Gin 
275 280 285 



Pro Ser Phe Pro Pro Ser Ser Ser Gly Arg Gly Gly Gly Thr Pro Asp 
290 295 300 



Ala Asn Ser Leu Ala Pro Pro Gly Lys Ala Gly Gly Gly Ser Gly Pro 
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305 310 315 ■ 320 



Gin Pro Pro Pro Gly Leu Val Tyr Pro Cys Gly Ala Cys Arg Ser Glu 
325 330 335 



Val Asn Asp Asp Gin Asp Ala lie Leu Cys Glu Ala Ser Cys Gin Lys 
340 345 350 



Trp Phe His Arg Glu Cys Thr Gly Met Thr Glu Ser Ala Tyr Gly Leu 
355 360 365 



Leu Thr Thr Glu Ala Ser Ala Val Trp Ala Cys Asp Leu Cys Leu Lys 
370 375 380 



Thr Lys Glu lie Gin Ser Val Tyr lie Arg Glu Gly Met Gly Gin Leu 
385 390 395 400 



Val Ala Ala Asn Asp Gly 
405 



